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/Tumor Necrosis Factor Receptors 5, 6a & 6(J 

Field of the Invention 

The present invention relates to novel human genes encoding 
polypeptides which are members of the TNF receptor family. More 
specifically, isolated nucleic acid molecules are provided encoding human 
polypeptides named tumor necrosis factor receptor-5. -6a & -6p hereinafter 
sometimes referred to as "TNFR-5. -6a. & -6P" or generically as "TNFR 
polypeptides". TNFR polypeptides are also provided, as are vectors, host cells 
and recombinant methods for producing the same. The invention further relates 
to screening methods for identifying agonists and antagonists of TNFR 
polypeptide activity. Also provided are diagnostic and therapeutic methods 
utilising such compositions. 

Background of the invention 

Many biological actions, for instance, response to certain stimuli and 
natural biological processes, are controlled by factors, such as cytokines. Many 
cytokines act through receptors by engaging the receptor and producing an intra- 
cellular response. 

For example, tumor necrosis factors (TNF) alpha and beta are cytokines 
whicn «ct through TNF receptors to regulate numerous biological processes, 
including protection against infection and induction of shock and inflammatory 
disease. The TNF molecules belong to the "TNF-ligand" supcrtamily. and act 
together with their receptors or counter-ligands. the "TNF-receptor" 
supcrtamily. So far. nine members of the TNF ligand supcrtamily have been 
identified and ten members of the TNF-receptor supcrtamily have been 
characterized. 

Among the ligands there are included TNF-a. lymphotoxin-a tLT-a. 
also known as TNF-ph LT-p (found in complex heterotrimer LT-a2-P ». FasL 
CD40L. CD27L. CD30L. 4-1BBL. OX40L and nerve growth factor »NGF». 
The supcrtamily of TNF receptors includes the p55TNF receptor. p75TNF 
receptor. TNF receptor-related protein. FAS antigen or APO-i. CD40. CD27. 



CD30, 4-lBB, OX40. low affinity p75 and NGF-reccptor f Meager, A., 
Biologicals. 22:291-295 ( 1994)). 

Many members of the TNF-ligand superfamily are express J by- 

activated T-celis, implying that they are necessary for T-cell interactions with 
other cell types which underlie cell ontogeny and functions. (Meager. A., 
supra). 

Considerable insight into the essential functions of several members of 
the TNF receptor family has been gained from the identification and creation of 
mutants that abolish the expression of these proteins. For example, naturally 
occurring mutations in the FAS antigen and its ligand cause lymphoprolifcrative 
disease (Watanabe-Fukunaga, R.. et al.. Nature 356:314 (1992)). perhaps 
reflecting a failure of programmed cell death. Mutations of the CD40 ligand 
cause an X-linked immunodeficiency state characterized by high levels of 
immunoglobulin M and low levels of immunoglobulin G in plasma, indicating 
faulty T-cell-dependent B-cell activation (Allen, R.C. et al.. Science 259:990 
(1993)). Targeted mutations of the low affinity nerve growth factor receptor 
cause a disorder characterized by faulty sensory innovation of peripheral 
structures (Lee, K.F. et al.. Cell 69:737 ( 1992)). 

TNF and LT-a are capable of binding to two TNF receptors (the 55- 
and 75-kd TNF receptors). A large number of biological effects elicited by 
TNF and LT-a, acting through their receptors, include hemorrhagic necrosis of 
transplanted tumors, cytotoxicity, a role in endotoxic shock, inflammation, 
immunoregulation, proliferation and anti- viral responses, as well as protection 
against the deleterious effects of ionizing radiation. TNF and LT-a are involved 
in the pathogenesis of a wide range of diseases, including endotoxic shock, 
cerebral malaria, tumors, autoimmune disease, AIDS and graft-host rejection 
(Beutler, B. and Von Huffel, C. Science 264:667-668 ( 1994)). Mutations in 
the p55 Receptor cause increased susceptibility to microbial infection. 

Moreover, an about 80 amino acid domain near the C-terminus of 
TNFR1 (p55) and Fas was reported as the "death domain." which is 
responsible for transducing signals for programmed cell death (Tanaglia et al.. 
Cell 74:845 (1993)). 

Apoptosis, or programmed cell death, is a physiologic process essential 
to the normal development and homeostasis of multicellular organisms <H. 
Steiler, Science 267. 1445-1449 ( 1995)). Derangements of apoptosis 
contribute to the pathogenesis of several human diseases including cancer, 
neurodegenerative disorder* and acquired immune deficiency syndrome (C.B. 
.Thompson. Science 267, 1456-1462 ( 1995)). Recently, much attention has 
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focused on the signal transduction and biological function of two cell surface 
death receptors, Fas/APO-1 and TNFR-1 (J.L. Cleveland. J.N. Ihle. Cell 81. 
479-482 (1995); A. Fraser, G. Evan, Cell 85, 781-784 ( 1996); S. Nagata, P. 
Golstein. Science 267^, 1449-56 (1995)). Both are members of the TNF 
receptor family which also include TNFR-2, low affinity NGFR, CD40. and 
CD30. among others (C.A. Smith, et al.. Science 248, 1019-23 ( 1990); M. 
Tewari, V.M. Dixit, in Modular Texts in Molecular and Cell Biology M. 
Purton, Heldin, Carl, Ed. (Chapman and Hall. London, 1995). While family 
members are defined by the presence of cysteine-rich repeats in their 
extracellular domains, Fas/APO-1 and TNFR-1 also share a region of 
intracellular homology, appropriately designated the "death domain", which is 
distantly related to the Drosophila suicide gene, reaper (P. Golstein, D. 
Marguet, V. Depraetere, Cell 81, 185-6 (1995); K. White et a!.. Science 264. 
677-83 ( 1994)). This shared death domain suggests that both receptors interact 
with a related set of signal transducing molecules that, until recently, remained 
unidentified. Activation of Fas/APO- 1 recruits the death domain-containing 
adapter molecule FADD/MORT1 (A.M. Chinnaiyan, K. O'Rourke. M. Tewari. 
V. M. Dixit, Cell 81, 505-12 ( 1995); M. P. Boldin. et a!., J. Biol Chem 270. 
7795-8 (1995); F.C. Kischkel, et al., EMBO 14, 5579-5588 ( 1995)). which in 
turn binds and presumably activates FLICE/M ACH 1 , a member of the 
ICE/CED-3 family of pro-apoptotic proteases (M. Muzio et al.. Cell 85. 817- 
827 (1996); M.P. Boldin, T.M. Goncharov, Y.V. Goltsev, D. Wallach, Cell 
85, 803-815 (1996)). While the central role of Fas/APO-1 is to trigger cell 
death. TNFR-1 can signal an array of diverse biological activities-many of 
which stem from its ability to activate NF-kB (L.A. Tartaglia. D.V. Goeddel. 
Immunol Today 13, 151-3(1992)). Accordingly. TNFR- 1 recruits the 
multivalent adapter molecule TR ADD, which like FADD, also contains a death 
domain (H. Hsu. J. Xiong, D.V. Goeddel. Cell 81. 495-504 ( 1995); H. Hsu. 
H.-B. Shu, M.-P. Pan. D.V. Goeddel. Cell 84. 299-308 ( 1996)). Through its 
associations with a number of signaling molecules including FADD. TR AF2. 
and RIP, TR ADD can signal both apoptosis and NF-kB activation (H. Hsu, H- 
B. Shu, M.-P. Pan. D.V. Goeddel. Cell 84, 299-308 ( 1996); H. Hsu. J. 
Huang, H.-B. Shu, V. Baichwai. D.V. Goeddel, Immunity 4. 387-396 
(1996)). 




The effects of TNF family ligands and TNF family receptors are varied 
and influence numerous functions, both normal and abnormal, in the biological 

processes of the mammalian system. There is a clear need, therefore, for 
identification and characterization of such receptors and ligands that influence 
biological activity, both normally and in disease states. In particular, there is a 
need to isolate and characterize novel members of the TNF receptor family. 

Summary of the Invention 

The present invention provides isolated nucleic acid molecules 
comprising a polynucleotide encoding at least a portion of a TNFR-5, -6a or - 
6p polypeptide having the complete amino acid sequences shown in SEQ ID 
NOS:2, 4 and 6, respectively, or the complete amino acid sequence encoded by 
a cDNA clone deposited in a bacterial host as ATCC Deposit Number 97798, 
97810 and 97809, respectively. The nucleotide sequence determined by 
sequencing the deposited TNFR-5, -6a and -6P clones, which are shown in 
Figures I, 2 and 3 (SEQ ID NOS:l, 3 and 5, respectively), contain open 
reading frames encoding complete polypeptides of 259, 300 and 170 amino 
acid residues, respectively, including an initiation codon encoding an 
N-terminal methionine at nucleotide positions 183-185, 25-27, and 73-75 in 
SEQ ID NOS: I, 3, and 5 respectively. 

The TNFR proteins of the present invention share sequence homology 
with other TNF receptors. TNFR-5 shows the highest degree of sequence 
homology with the translation product for the human mRN A for nerve growth 
factor receptor (Figure 4) (SEQ ID NO:9), including multiple conserv ed 
cysticne rich domains. Splice variants TNFR-6a and -6P show the highest 
degree of sequence homology with the translation products of the human 
mRN As for TNFR-I and -II (Figure 4) (SEQ ID NOS:7 and 8, respectively) 
also including multiple conserved cysteine rich domains. 

The TNFR-5, -6a and -6P polypeptides have predicted leader 
sequences of 26, 30 and 30 amino acids, respectively; and the amino acid 
sequence of the predicted mature TNFR-5, -6a and -6P polypeptides are also 



shown in Figures 1, 2 and 3 as amino acid residues 27-259 (SEQ ID NO:2). 31- 
300 (SEQ ID NO:4), and 31-170 (SEQ ID NO:6), respectively. 

Thus, one aspect of the invention provides an isolated nucleic acid 
molecule comprising a polynucleotide having a nucleotide sequence selected 
from the group consisting of: (a) a nucleotide sequence encoding a TNFR 
polypeptide having the complete amino acid sequence in SEQ ID NO:2. 4 or 6. 
or as encoded by the cDNA clone contained in ATCC Deposit No. 97798, 
97810 or 97809; (b) a nucleotide sequence encoding a mature TNFR 
polypeptide having the amino acid sequence at positions 27-259 in SEQ ID 
NO:2, 31-300 in SEQ ID NO:4, or 31-170 in SEQ ID NO:6, or as encoded by 
the cDNA clone contained in ATCC Deposit No. 97798, 97810 or 97809; <c> 
a nucleotide sequence encoding a soluble extracellular domain of a TNFR 
polypeptide having the amino acid sequence at positions 27 to 240 in SEQ ID 
NO:2, 31 to 283 in SEQ ID NO:4. or 31 to 166 in SEQ ID NO:6, or as 
encoded by the cDN A clone contained in the ATCC Deposit No. 97798. 
97810 or 97809; and (d) a nucleotide sequence complementary to any of the 
nucleotide sequences in (a), (b), or (c) above. 

Further embodiments of the invention include isolated nucleic acid 
molecules that comprise a polynucleotide having a nucleotide sequence at least 
90% identical, and more preferably at least 95%, 96%, 97%, 98% or 99% 
identical, to any of the nucleotide sequences in (a), (b), (c) and (d) above, or a 
polynucleotide which hybridizes under stringent hybridization conditions to a 
polynucleotide in (a), (b), (c) or (d). above. This polynucleotide which 
hybridizes does not hybridize under stringent hybridization conditions to a 
polynucleotide having a nucleotide sequence consisting of only A residues or 
of only T residues. An additional nucleic acid embodiment of the invention 
relates to an isolated nucleic acid molecule comprising a polynucleotide which 
encodes the amino acid sequence of an epitope-bcaring portion of a TNFR 
polypeptide having an amino acid sequence in (a), (b) or (c). above. 



The present invention also relates to recombinant vectors, which 
include the isolated nucleic acid molecules of the present invention, and to host 
cells containing the recombinant vectors, as well as to methods of making such 
vectors and host cells and for using them for production of TNFR 
polypeptides or peptides by recombinant techniques. 

The invention further provides an isolated TNFR polypeptide 
comprising an amino acid sequence selected from the group consisting of: (a) 
the amino acid sequence of a full-length TNFR polypeptide having the 
complete amino acid sequence shown in SEQ ID NO:2, 4 or 6 or as encoded 
by the cDNA clone contained in ATCC Deposit No. 97798, 97810 or 97809; 
(b) the amino acid sequence of a mature TNFR polypeptide having the amino 
acid sequence ai positions 27-259 in SEQ ID NO:2, 31-300 in SEQ ID NO:4, 
or 3 i-170 in SEQ ID NO:6, or as encoded by the cDNA clone contained in 
ATCC Deposit No. 97798, 97810 or 97809; or (c) the amino acid sequence of 
a soluble extracellular domain of a TNFR polypeptide having the amino acid 
sequence at positions 27 to 240 in SEQ ID NO:2, 31 to 283 in SEQ ID NO:4, 
or 31 to 166 in SEQ ID NO:6, or as encoded by the cDNA clone contained in 
ATCC Deposit No. 97798, 97810 or 97809. 

The polypeptides of the present invention also include polypeptides 
having an amino acid sequence at least 80% identical, more preferably at least 
90% identical, and still more preferably 95%, 96%, 97° o, 98% or 99% identical 
to those described in (a), (b) or (c) above, as well as polypeptides having an 
amino acid sequence with at least 90° o similarity, and more preferably at least 
95% similarity, to those above. 

An additional embodiment of this aspect of the invention relates to a 
peptide or polypeptide which comprises the amino acid sequence of an 
epitope-bearing portion of a TNFR polypeptide having an amino acid 
sequence described in (a), (b) or (c). above. Peptides or polypeptides having 
the amino acid sequence of an epitope-bearing portion of a TNFR polypeptide 
of the invention include portions of such polypeptides with at least six or 



seven, preferably at least nine, and more preferably at least about 30 amino 
acids to about 50 amino acids, although epitope-bearing polypeptides of any 
length up to and including the entire amino acid sequence of a polypeptide of 
the invention described above also are included in the invention. 

In another embodiment, the invention provides an isolated antibody 
that binds specifically to a TNFR polypeptide having ah amino acid sequence 
described in (a), (b) or (c) above. The invention further provides methods for 
isolating antibodies that bind specifically to a TNFR polypeptide having an 
amino acid sequence as described herein. Such antibodies are useful 
diagnostically or therapeutically as described below. 

Tumor Necrosis Factor (TNF) family ligands are known to be among 
the most pleiotropic cytokines, inducing a large number of cellular responses, 
including cytotoxicity, anti-viral activity, immunoregulatory activities, and the 
transcriptional regulation of several genes. The invention also provides for 
pharmaceutical compositions comprising TNFR polypeptides, particularly 
human TNFR polypeptides, which may be employed, for instance, to treat 
infectious disease including HIV infection, endotoxic shock, cancer, 
autoimmune diseases, graft vs. host disease, acute graft rejection, chronic graft 
rejection, neurodegenerative disorders, myelodysplastic syndromes, ischemic 
injury, toxin-induced liver disease, septic shock, cachexia and anorexia. 
Methods of treating individuals in need of TNFR polypeptides are also 
provided. 

The invention further provides compositions comprising a TNFR 
polynucleotide or a TNFR polypeptide for administration to cells in vitro, to 
cells cx vivo and to cells in vivo, or to a multicellular organism. In certain 
particularly preferred embodiments of this aspect of the invention, the 
compositions comprise a TNFR polynucleotide for expression of a TNFR 
polypeptide in a host organism for treatment of disease. Particularly preferred 
in this regard is expression in a human patient for treatment of a dysfunction 
associated with aberrant endogenous activity of a TNFR polypeptide 





In another aspect, a screening assay for agonists and antagonists is 
provided which involves determining the effect a candidate compound has on 
TNFR polypeptide binding to a TNF-family ligand. In particular, the method 
involves contacting the TNF-family ligand with a TNFR polypeptide and a 
5 candidate compound and determining whether TNFR polypeptide binding to 

the TNF-family ligand is increased or decreased due to the presence of the 
candidate compound. In this assay, an increase in binding of a TNFR 
polypeptide over the standard binding indicates that the candidate compound 
is an agonist of TNFR polypeptide binding activity and a decrease in TNFR 
10 polypeptide binding compared to the standard indicates that the compound is 

ai. j* ^agonist of TNFR polypeptide binding activity. 

It has been discovered that TNFR- 5 is expressed not only in prostate 
tissue but also in endothelial cells, stimulated monocytes and kerotinocytes. 
TNFR-6a and -6(3 are expressed endothelial cells, keratinocytes, normal 

i5 prostate and prostate rumor tissue. For a number of disorders of theses 

tissues or cell, particularly of the immune system significantly higher or lower 
levels of TNFR gene expression may be detected in certain tissues (e.g.. 
cancerous tissues) or bodily fluids (e.g.. serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to a 

20 "standard" TNFR gene expression level, i.e.. the TNFR expression level in 

healthy tissue from an individual not having the immune system disorder. 
Thus, the invention provides a diagnostic method useful during diagnosis of 
such a disorder, which involves: (a) assaying TNFR gene expression level in 
cells or body fluid of an individual: (b) comparing the TNFR gene expression 

25 level with a standard TNFR gene expression level, whereby an increase or 

decrease in the assayed TNFR gene expression level compared to the standard 
expression level is indicative of disorder in the immune system. 

An additional aspect of the invention is related to a method for treating 
- - an individual in need of an increased level of TNFR polypeptide activity in the 

*o body comprising administering to such an individual a composition comprising 
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a therapeutically effective amount of an isolated TNFR poiypeptide of the 
invention or an agonist thereof. 

A still further aspect of the invention is related to a method for treating 
an individual in need of a decreased level of TNFR polypeptide activity in the 
S body comprising, administering to such an individual a composition 

comprising a therapeutically effective amount of a TNFR antagonist. 
Preferred antagonists for use in the present invention are TNFR-speciftc 
antibodies. 

Brief Description of the Figures 

10 Figure 1 shows the nucleotide sequence (SEQ ID NO: 1 ) and deduced 

amino acid sequence (SEQ ID NO:2) of TNFR-5. 

Figure 2 shows the nucleotide sequence (SEQ ID NO:3) and deduced 
amino acid sequence (SEQ ID NO:4) of TNFR-6a. 

Figure 3 shows the nucleotide sequence (SEQ ID NO:5) and deduced 
15 amino acid sequence (SEQ ID NO:6) of TNFR-6P. 

Figure 4 shows an alignment created by the Clustal method using the 
Megaline program in the DNAstar suite comparing the amino acid sequences 
of TNFR-5 ("TNFR-likc"), TNFR-6ot ("TNFR-6a"), and TNFR-6P ("TNFR- 
6b") with other TNF receptors, as follows: TNFR I (SEQ ID NO:7); TNFR2 
20 (SEQ ID NO:K); NGFR (SEQ ID NO:9); LTbR (SEQ ID NO: 10); FAS 

(SEQ ID NO: I 1); CD27 (SEQ ID NO: 12); CD30 (SEQ ID NO: 13); CD40 
(SEQIDNO:14); 4-1 BB (SEQ ID NO: 1 5); OX40 (SEQ ID NO: 16); VC22 
(SEQIDNO:17); and CRMB (SEQ ID NO: IK). 

Figures 5, 6 and 7 show separate analyses of the TNFR-5, 6a and -60 
25 amino acid sequences, respectively. Alpha, beta, turn and coil regions; 

hydrophilicity and hydrophobicity; amphipathic regions; flexible regions; 
antigenic index and surface probability are shown. In the "Antigenic Index - 
Jameson- Wolf ' graphs, the indicate location of the highly antigenic regions ot" 
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the proteins, i.e., regions from which cpitope-bearing j. 
may be obtained. 

Figure 8 shows the nucleotide sequence of fragments related to the 
TNFR genes of the present invention, including: HPRCB54R (SEQ ID 
NO:19), HSJAU57RA (SEQ ID NO:20), HELBP70R (SEQ ID NO:21 ). and 
HUSGB54R (SEQ ID NO:22) all of which are related to SEQ ID NO: 1; and 
MELDI06R (SEQ ID NO:23) and HCEOW38R (SEQ ID NO:24) both of 
which are related to SEQ ID NOS:3 and 5. 

Detailed Description 

l0 The present invention provides isolated nucleic acid molecules 

comprising a polynucleotide encoding a TNFR-5, -6a or -6£ polypeptide, 
genencally "TNFR polypeptide^)" having the amino acid sequence shown in 
SEQ ID NOS:2. 4 and 6, respectively, which were determined by sequencing 
cloned cDNAs. The nucleotide sequences shown in Figures I, 2 and 3 (SEQ 
15 ID NOS:K 3 and 5) were obtained by sequencing the HPRCB54, HPHAE52 

and HTPCIIX4 clones, which were deposited on November 20, 1996, 
November 22. 1996, and November 22. 1996. respectively, at the American 
Type Culture Collection. 12301 Park Lawn Drive. Rockville. Maryland 
20X52, and given accession numbers ATCC 97798, 97810 and 97809, 
:o respectively. The deposited clones are contained in the pBluescnpt SK(-) 

plasmid (Stratagene. La Jolla, CA). 

The TNFR- 5 protein of the present invention has an amino acid 
sequence which is 21 .7% identical to and shares multiple conserved cysteine 
rich domains with the translation product of the human nerve growth factor 
25 (hNGF) mRNA (SEO ID NO:9> as illustrated in Figure 4. hNGF is thought to 

play an important role in the development, survival, apoptosis and function of 
ncuions (Lee, F.K. cf aL, Cell 69:737) and lymphocytes (Torcia, M. ct al.. Cell 
*5:3369 <|W6». 



The TNFR-6a and -6p proteins of the present invention are splice 
variants which share an identical nucleotide and amino acid sequence over the 
N-terminal 142 residues of the respective proteins. The amino acid sequences 
of these proteins are about 23% similar to and share multiple conserved 
cysteine rich domains with the translation product of the human TNFR-2 
mRNA (Figure 4) (SEQ ID NO:8). Importantly, these protein* share 
substantial sequence similarity over their extracellular domains including four 
repeated cysteine rich motifs with significant intersubunit homology. TNFR-2 
is thought to exclusively mediate human T-cell proliferation by TNF (PCT 
WO/94/09137). 

Nucleic Acid Molecules 

Unless otherwise indicated, all nucleotide sequences determined by 
sequencing a DNA molecule herein were determined using in automated DNA 
sequencer (such as the Model 373 from Applied Biosystems, Inc., Foster 
City, CA), and all amino acid sequences of polypeptides encoded by DNA 
molecules determined herein were predicted by translation of a DNA seqi. j ce 
determined as above. Therefore, as is known in the art for any DNA sequence 
determined by this automated approach, any nucleotide sequence determined 
herein may contain some errors. Nucleotide sequences determined by 
automation are typically at least about 90% identical, more typically at least 
about 95% to at least about 99.9% identical to the ncti.al nucleotide sequence 
of the sequence J DNA molecule. The actual sequence can be more precisely 
determined by other approaches including r.nuual DNA sequencing methods 
well known in the art. As is also known in the art, a single insertion or 
deletion in a determined nucleotide sequence compared to the actual sequence 
will cause a fra:nc shift in translation of the nucleotide sequence such that the 
predicted amino acid sequence encoded by determined nucleotide sequence 
will be completely different from the amino ^cid sequence actually encoded by 



the sequenced DNA molecule, beginning at the point of such an insertion or 
deletion. 

By "nucleotide sequence" of a nucleic acid molecule or polynucleotide 
is intended, for a DNA molecule or polynucleotide, a sequence of 
deoxyribonucleotides, and for an RNA molecule or polynucleotide, the 
corresponding sequence of ribonucleotides (A, G, C and U), where each 
thymidine deoxyribonucleotide (T) in the specified deoxyribonucleotide 
sequence is replaced by the ribonucleotide uridine (U). 

Using the information provided herein, such as the nucleotide 
sequences in Figures 1, 2 and 3 (SEQ ID NOS:l, 3 and 5), a nucleic acid 
molecule of the present invention encoding a TNFR polypeptide may be 
obtained using standard cloning and screening procedures, such as those for 
cloning cDNAs using mRNA as starting material. Illustrative of the invention, 
the TNFR-5 nucleic acid molecule described in Figure 1 was discovered in a 
cDNA library derived from prostate tissue. Additional clones of the same 
gene were also identified in cDNA libraries from the following tissues: 
endothelial cells, stimulated monocytes, and kerotinocytes. TNFR-6ct and -6[J 
clones (Figures 2 and 3, respectively) were identified in cDNA libraries from 
the following tissues: endothelial cells, keratinocytes, normal prostate tissue, 
and prostate tumor tissue. 

The determined nucleotide sequences of the TNFR cDNAs of Figures 
I, 2 and 3 (SEQ ID NOS.l, 3 and 5) contain open reading frames encoding 
proteins of 259, 300 and 1 70 amino acid residues, u tth an initiation codon at 
nucleotide positions 183-185, 25-27, 73-75 of the nucleotide sequences in 
Figures 1. 2 and 3 (SEQ ID NOS:l, 3 and 5), respectively. 

The open reading frames of the TNFR -6a and -6P genes share 
sequence homology with the translation product of the human mRNA for 
TNFR-2, including the soluble extracellular domain of about residues 3 1-283 
of SEQ ID NO:4 and 31-166 of SEQ ID NO:6, respectively. The open reading 
frame of the TNFR-5 gene shares sequence homology with the translation 
•product of the human mRNA for NGFR, including the following conserved 



domains: (a) a soluble extracellular domain of about 214 amino acids (residues 
27-240 of SEQ ID NO:2); and (b) a transmembrane domain of about 19 amino 
acids (residues 24 1 -259 of SEQ ID NO:2). 

As one of ordinary skill would appreciate, due to the possibilities of 
sequencing errors discussed above, the actual complete TNFR polypeptides 
encoded by the deposited cDNAs, which comprise about 259, 300, and 170 
amino acids, may be somewhat longer or shorter. More generally, the actual 
open reading frames may be anywhere in the range of ±20 amino acids, more 
likely in the range of ±10 amino acids, of that predicted from the first 
methionine codon from the N-terminus shown in Figures 1, 2 and 3 (SEQ ID 
NOS:l, 3 and 5), which is in-frame with the translated sequences shown in 
each respective figure. It will further be appreciated that, depending on the 
analytical criteria used for identifying various functional domains, the exact 
"address" of the extracellular and transmembrane domain(s) of the TNFR 
polypeptides may differ slightly from the predicted positions above. For 
example, the exact location of the extracellular domain in SEQ ID NO:2 may 
vary slightly (e.g., the address may "shift" by about 1 to about 20 residues, 
more likely about 1 to about 5 residues) depending on the criteria used to 
define the domain. In this case, he beginning of the transmembrane domain 
and the end of the extracellular domain were predicted on the basis of the 
identification of the hydrophobic amino acid sequence in the above indicated 
positions, as shown in Figure 5. In any event, as discussed further below, the 
invention further provides polypeptides having various residues deleted from 
the N-terminus of the complete polypeptide, including polypeptides lacking 
one or more amino acids from the N-terminus of the extracellular domain 
described herein, which constitute soluble forms of the extracellular domains of 
the TNFR-5. -6a & -6p proteins. 
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Leader an</ Mature Sequences 

The amino acid sequences of the complete TNFR proteins include a 
leader sequence and a mature protein, as shown in SEQ ID NOS2, 4 and 6: 
More in particular, the present invention provides nucleic acid molecules 
5 encoding mature forms of the TNFR proteins. Thus, according to the signal 

hypothesis, once export of the growing protein chain across the rough 
endoplasmic reticulum has been initiated, proteins secreted by mammalian cells 
have a signal or secretory leader sequence which is cleaved from the complete 
polypeptide to produce a secreted "mature" form of the protein. Most 
10 mammalian cells and even insect cells cleave secreted proteins with the same 

specificity. However, in some cases, cleavage of a secreted protein is not 
entirely uniform, which results in two or more mature species of the protein. 
Further, it has long been known that the cleavage specificity of a secreted 
protein is ultimately determined by the primary structure of the complete 
15 protein, that is, it is inherent in the amino acid sequence of the polypeptide. 

Therefore, the present invention provides a nucleotide sequence encoding a 
mature TNFR polypeptide having the amino acid sequence encoded by a 
cDNA clone contained in a host identified as ATCC Deposit No. 97798, 
97810 or 97809. By the "mature TNFR polypeptides having the amino acid 
20 sequence encoded by a cDNA clone in ATCC Deposit No. 97798, 97810. or 

978.09" is meant the mature form(s) of the protein produced by expression in a 
mammalian cell (e.g., COS cells, as described below) of the complete open 
reading frame encoded by the human DNA sequence of the clone contained in 
the vector in the deposited host. 
25 in addition, methods for predicting whether a protein has a secretory 

leader as well as the cleavage point for that leader sequence are available. For 
instance, the method of McC.eoch (Virus Res. 3:271-286 (1985)) uses the 
information from a short N-terminal charged region and a subsequent 
uncharged region of the complete (uncleaved) protein. The method of von 
yo Mcinje (Nucleic Acids Res. 74:4683-4690 (1986)) uses the information from the 
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residues surrounding the cleavage site, typically residues -13 to +2 where +1 
indicates the amino terminus of the mature protein. The accuracy of predicting 
the cleavage points of known mammalian secretory proteins for each of these 
methods is in the range of 75-80% (von Heinje, supra). However, the two 
methods do not always produce the same predicted cleavage point(s) for a 
given protein. 

In the present case, the deduced amino acid sequence of the complete 
TNFR polypeptides were »Myzed by a computer program "PSORT", 
available from Dr. Kenta . of the Institute for Chemical Research, Kyoto 
University (see K. Nakai and M. Kanehisa, Genomics 897-91 1 (1992)), 
which is an expert system for predicting the cellular location of a protein based 
on the amino acid .sequence. As part of this computational prediction of 
localization, the methods of McGeoch and von Heinje are incorporated. The 
analysis of the TNFR amino acid sequences by this program provided the 
following results: TNFR -5, -6a & -6(3 encode mature polypeptides having the 
amino acid sequences of residues 27-259, 31-300 and 31-170 of SEQ ID 
NOS:2, 4 and 6, respectively. 

As indicated, nucleic acid molecules of the present invention may be in 
the form of RNA, such as mRNA, or in the form of DNA, including, for 
instance, cDNA and genomic DNA obtained by cloning or produced 
synthetically. The DNA may be double-stranded or single-stranded. 
Single-stranded DNA or RNA may be the coding strand, also known as the 
sense strand, or it may be the non-coding strand, also referred to as the 
anti-sense strand. 

By "isolated" nucleic acid molccule(s) is intended a nucleic acid 
molecule, DNA or RNA, which has been removed from its native environment 
For example, recombinant DNA molecules contained in a vector are considered 
isolated for the purposes of the present invention. Further examples of 
isolated DNA molecules include recombinant DNA molecules maintained in 
heterologous host cells or purified (partially or substantially) DNA molecules 



in solution. Isolated RNA molecules include in vivo or in vitro RNA 
transcripts of the DNA molecules of the present invention. Isolated nucleic 
acid molecules according to the present invention further include such 
molecules produced synthetically. 

Isolated nucleic acid molecules of the present invention include DNA 
molecules comprising an open reading frame (ORF) with an initiation codon at 
positions 183-185, 25-27 73-75 of the nucleotide sequences shown in SEQ ID 
NOS:l, 3 and 5 respectively. 

Also included are DNA molecules comprising the coding sequence for 
the predicted mature TNFR polypeptides shown at positions 27-259, 31-300, 
and 31-170 of SEQ ID NOS:2, 4 and 6, respectively. 

In addition, isolated nucleic acid molecules of the invention include 
DNA molecules which comprise a sequence substantially different from those 
described above but which, due to the degeneracy of the genetic code, still 
encode a TNFR protein. Of course, the genetic code and species-specific 
codon preferences are well known in the art. Thus, it would be routine for one 
skilled in the art to generate the degenerate variants described above, for 
instance, to optimize codon expression for a particular host (e.g., change 
codons in the human mRNA to those preferred by a bacterial host such as £. 
coli). 

In another aspect, the invention provides isolated nucleic acid 
molecules encoding a TNFR polypeptide having an amino acid sequence 
encoded by the cDNA clone contained in the plasmid deposited as ATCC 
Deposit No. 97798, 97810, or 97809. Preferably, this nucleic acid molecule 
will encode the mature polypeptide encoded by the above-described deposited 
cDNA clone. 

The invention further provides an isolated nucleic acid molecule having 
the nucleotide sequence shown in Figure 1, 2 or 3 (SEQ ID NO: I, 3 or 5) or 
.the nucleotide sequence of the TNFR cDNAs contained in the above-described 




deposited clones, or a nucleic acid molecule having a sequence complementary 
to one of the above sequences. Such isolated molecules, particularly DNA 
molecules, are useful as probes for gene mapping, by in situ hybridization with 
chromosomes, and for detecting expression of the TNFR genes in human 
tissue, for instance, by Northern blot analysis. 

The present invention is further directed to nucleic acid molecules 
encoding portions of the nucleotide sequences described herein as well as to 
fragments of the isolated nucleic acid molecules described herein. In particular, 
the invention provides polynucleotides having a nucleotide sequence 
representing the portion of SEQ ID NO:l, 3 or 5 which consist of positions 
183-959, 25-924 and 73-582 of SEQ ID NOS:l, 3 and 5, respectively. Also 
contemplated are polynucleotides encoding TNFR polypeptides which lack an 
amino terminal methionine such polynucleotides having a nucleotide sequence 
representing the portion of SEQ ID NOS:l, 3 and 5 which consist of positions 
186-959, 28-924, and 76-582 of SEQ ID NOS:I,3 and 5, respectively. 
Polypeptides encoded by such polynucleotides are also provided, such 
polypeptides comprising an amino acid sequence at positions 2-259, 2-300, 
and 2-170 of SEQ ID NOS:2, 4, and 6, respectively. 

In addition, the invention provides nucleic acid molecules having 
nucleotide sequences related to extensive portions of SEQ ID NOS: 1 , 3 and 5 
which have been determined from the following related cDN A clones: 
HELBP70R (SEQ ID NO:21), HPRCB54R (SEQ ID NO: 19). HSJAU57RA 
(SEQ ID NO:20) and HUSCB54R (SEQ ID NO:22) are related to SEQ ID 
NO: 1 ; HELDI06R (SEQ ID NO:23) and HCEOW38R (SEQ ID NO:24) are 
related to SEQ ID NOS:3 and 5. The nucleotide sequences of each of these 
clones is shown in Figure 8. 

More generally, by a fragment of an isolated nucleic acid molecule 
having the nucleotide sequence of the deposited cDNA or the nucleotide 
sequence shown in Figures K 2 or 3 (SEQ ID NOS: I. 3 or 5) is intended 
fragments at least about 1 5 nt, and more preferably at least about 20 nt. still 
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more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which are useful as diagnostic probes and primers as discussed 
herein. Of course, larger fragments 50-300 nt in length are also useful 
according to the present invention as are fragments corresponding to most, if 
5 not all, of the nucleotide sequence of the deposited cDNAs or as shown in 

Figures I, 2 and 3 (SEQ ID NOS: l , 3 and 5). By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous 
bases from the nucleotide sequence of a deposited cDNA or the nucleotide 
sequence as shown in Figures l, 2 and 3 (SEQ ID NOS: I, 3 and 5). Preferred 
10 nucleic acid fragments of the present invention include nucleic acid molecules 

encoding epitope-bearing portions of the TNFR polypeptides as identified in 
Figures 5, 6 and 7 and described in more detail below. 

In another aspect, the invention provides an isolated nucleic acid 
molecule comprising a polynucleotide which hybridizes under stringent 
15 hybridization conditions to a portion of the polynucleotide in a nucleic acid 

molecule of the invention described above, for instance, a cDNA clone 
contained in ATCC Deposit No. 97798, 97810 or 97809. By "stringent 
hybridization conditions" is intended overnight incubation at 42° C in a 
solution comprising: 50% formamide, 5x SSC (150 mM NaCl, 15 mM 
20 trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardfs solution, 

10% dextran sulfate, and 20 pg/ml denatured, sheared salmon sperm DNA, 
followed by washing the filters in 0.1 x SSC at about 65° C. 

By a polynucleotide which hybridizes to a "portion" of a 
polynucleotide is intended a polynucleotide (either DNA or RNA) hybridizing 
25 to at least about 15 nucleotides (nt), and more preferably at least about 20 nt, 

still more preferably at least about 30 nt, and even more preferably about 
30-70 (e.g., 50) nt of the reference polynucleotide. The*e are u>etul as 
diagnostic probes and pnmcrs as discussed above and in more detail below. 
By a portion of a polynucleotide of "at least 20 nt in length." for 
30 - example, is .mended 20 or more contiguous nucleotides from the nucleotide 



sequence of the reference polynucleotide (e.g M a deposited cDNA or a 
nucleotide sequence as shown in Figure 1 , 2 or 3 (SEQ ID NO: 1 1 3 or 5)). Of 
course, a polynucleotide which hybridizes only to a poly A sequence (such as 
the 3' terminal poly(A) tract of a TNFR cDNA, or to a complementary stretch 
of T (or U) residues, would not be included in a polynucleotide of the 
invention used to hybridize to a portion of a nucleic acid of the invention, 
since such a polynucleotide would hybridize to any nucleic acid molecule 
containing a poly (A) stretch or the complement thereof (e.g., practically any 
double-stranded cDNA clone). 

As indicated, nucleic acid molecules of the present invention which 
encode a TNFR polypeptide may include, but are not limited to those 
encoding the amino acid sequence of the mature polypeptide, by itself; and the 
coding sequence for the mature polypeptide and additional sequences, such as 
those encoding the about 26-31 amino acid leader or secretory sequence, s»ach 
as a pre-, or pro- or prepro- protein sequence; the coding sequence of the 
mature polypeptide, with or without the aforementioned additional coding 
sequences. 

Also encoded by nucleic acids of the invention are the above protein 
sequences together with additional, non-coding sequences, including for 
example, but not limited to introns and non-coding 5* and 3* sequences, such 
as the transcribed, non-translated sequences that play a role in transcription, 
mRNA processing, including splicing and polyadenylation signals, for example 
- ribosome binding and stability of mRNA; an additional coding sequence 
which codes for additional amino acids, such as those which provide additional 
functionalities. 

Thus, the sequence encoding the polypeptide may be fused to a marker 
sequence, such as a sequence encoding a peptide which facilitates purification 
of the fused polypeptide. In certain preferred embodiments of this aspect of 
the invention, the marker amino acid sequence i* a hexa-histidine peptide, such 
as the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, 
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Chatsworth, CA, 91311), among others, many of which arc commercially 
available. As described in Gentz et al t Proc. Natl Acad. Sci. USA 56:821-824 
(1989), for instance, hexa-histidine provides for convenient purification of the 
fusion protein. The "HA" tag is another peptide useful for purification which 
5 corresponds to an epitope derived from the influenza hemagglutinin protein, 

which has been described by Wilson et aL Cell 37: 767 ( 1 984). As discussed 
below, other such fusion proteins include a TNFR-5, -6a or -6f$ fused to Fc at 
the N- or C-terminus. 

Variant and Mutant Polynucleotides 

, 0 The present invention further relates to variants of the nucleic acid 

molecules of the present invention, which encode portions, analogs or 
derivatives of a TNFR polypeptide. Variants may occur naturally, such as a 
natural allelic variant. By an "allelic variant" is intended one of several 
alternate forms of a gene occupying a given locus on a chromosome of an 

15 organism. Genes //, Lewin, B., ed., John Wiley & Sons, New York (1985). 

Non-naturally occurring variants may be produced using art-known 
mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, 
deletions or additions. The substitutions, deletions or additions may involve 

20 one or more nucleotides. The variants may be altered in coding regions, 

non-coding regions, or both. Alterations in the coding regions may produce 
conservative or non-conservative amino acid substitutions, deletions or 
additions. Especially preferred among these are si'ent substitutions, additions 
and deletions, which do not alter the properties and activities of the TNFR 

25 polypeptide or portions ihereof. Also especially preferred in this regard are 

conservative substitutions. 

I lighly preferred are nucleic acid molecules encoding a mature protein 
having an amino acid sequence shown in SEQ ID NOS:2, 4 and 6 or the mature 
' * TNFR polypeptide sequences encoded by the deposited cDN A clones. 



w 



-21- ^ 



Most highly preferred are nucleic acid molecules encoding the 
extracellular domain of a protein having the amino acid sequence shown in SEQ 
ID NO:2, 4 or 6 or the extracellular domain of a TNFR amino acid sequence 
encoded by a deposited cDNA clone. 

Further embodiments include an isolated nucleic acid molecule 
comprising a polynucleotide having a nucleotide sequence at least 90% 
identical, and more preferably at least 95%, 96%, 97%, 98% or 99% identical 
to a polynucleotide selected from the group consisting of: (a) a nucleotide 
sequence encoding a TNFR polypeptide having the complete amino acid 
sequence in SEQ ID NO:2, 4 or 6, or as encoded by a cDNA c!one contained in 
ATCC Deposit No. 97798, 97810, or 97809; (b) a nucleotide sequence 
encoding a mature TNFR polypeptide having an amino acid sequence at 
positions 27-259, 31-300 or 31-170 in SEQ ID NO:2, 4 or 6, respectively, or 
as encoded by a cDNA clone contained in ATCC Deposit No. 97798, 97810 
or 97809; (c) a nucleotide sequence encoding a soluble extracellular domain of a 
TNFR polypeptide having the amino acid sequence at positions 27-240, 31- 
283, and 3 1-166 of SEQ ID NOS:2, 4 and 6, respectively; and (d) a nucleotide 
sequence complementary to any of the nucleotide sequences in (a), (b) or (c) 
above. 

Further embodiments of the invention include isolated nucleic acid 
molecules that comprise a polynucleotide having a nucleotide sequence at least 
90% identical, and more preferably at least 95%, 96%, 97%, 98% or 99% 
identical, to any of the nucleotide sequences in (a), (b), (c), or (d), above, or a 
polynucleotide which hybridizes under stringent hybridization conditions to a 
polynucleotide in (a), (b), (c), or (d), above. This polynucleotide which 
hybridizes docs not hybridize under stringent hybridization conditions to a 
polynucleotide having a nucleotide sequence consisting of only A residues or 
of only T residues. An additional nucleic acid embodiment of the invention 
relates to an isolated nucleic acid molecule comprising a polynucleotide which 



encodes the amino acid sequence of an epitope-bearing portion of a TNFR 
polypeptide having an amino acid sequence in (a), (b), (c), or (d), above. 

The present invention also relates to recombinant vectors, which 
include the isolated nucleic acid molecules of the present invention, and to host 
ceils containing the recombinant vectors, as well as to methods of making such 
vectors and host cells and for using them for production of TNFR 
polypeptides or peptides by recombinant techniques. 

By a polynucleotide having a nucleotide sequence at least, for example, 
95% "identical" to a reference nucleotide sequence encoding a TNFR 
polypeptide is intended that the nucleotide sequence of the polynucleotide is 
identical to the reference sequence except that the polynucleotide sequence 
may include up to five point mutations per each 100 nucleotides of the 
reference nucleotide sequence encoding the TNFR polypeptide. In other 
words, to obtain a polynucleotide having a nucleotide sequence at least 95% 
Mcntical to a reference nucleotide sequence, up to 5% of the nucleotides in the 
reference sequence may be deleted or substituted with another nucleotide, or a 
number of nucleotides up to 5% of the total nucleotides in the reference 
sequence may be inserted into the reference sequence. These mutations of the 
reference sequence may occur at the 5* or 3' terminal positions of the reference 
nucleotide sequence or anywhere between those terminal positions, 
inierspcrsed cither individually among nucleotides in the reference sequence or 
in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular nucleic acid molecule is at 
least 90%, 95%, 96%. 97° o, 9H° o or 99% identical to, for instance, a nucleotide 
sequence shown in Figure 1, 2 or 3, or to the nucleotides sequence of a 
deposited cDN A clone can be determined conventionally using known 
computer programs such as the Bestfit program (Wisconsin Sequence Analysis 
Package, Version X for Unix. Genetics Computer Group. University Research 
-Park, 575 Science Drive. Madison, Wl 5371 1 ). Bestfit uses the local 
homology algorithm of Smith and Waterman, Advumcs in Applied Mathematics 
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^ 2:482-489 ( 1 98 1 ), to find the best segment of homology between two 

sequences. When using Bestfit or any other sequence alignment program to 
determine whether a particular sequence is, for instance, 95% identical to a 
reference sequence according to the present invention, the parameters are set, 

5 of course, such that the percentage of identity is calculated over the full length 

of the reference nucleotide sequence and that gaps in homology of up to 5% of 
the total number of nucleotides in the reference sequence are allowed. 

The present application is directed to nucleic acid molecules at least 
90%, 95%, 96%, 97%, 98% or 99% identical to a nucleic acid sequence shown 

1 o in Figure 1.2 or 3 (SEQ ID NO: 1 , 3 or 5) or to the nucleic acid sequence of a 

deposited cDNA, irrespective of whether they encode a polypeptide having 
TNFR activity. This is because even where a particular nucleic acid molecule 
does not encode a polypeptide having TNFR activity, one of skill in the art 
would still know how to use the nucleic acid molecule, for instance, as a 

1 5 hybridization probe or a polymerase chain reaction (PCR) primer. Uses of the 

nucleic acid molecules of the present invention that do not encode a 
polypeptide having TNFR activity include, inter alia, (1) isolating a TNFR 
gene or allelic variants thereof in a cDNA library; (2) in situ hybridization (e.g., 
"FISH") to metaphase chromosomal spreads to provide precise chromosomal 

20 location of the TNFR gene, as described in Verma et al.. Human Chromosomes: 

A Manual of Basic Technique*, Pergamon Press, New York (1988); and 
Northern Blot analysis for detecting TNFR mRNA expression in specific 
tissues. 

Preferred, however, are nucleic acid molecules having sequences at least 
25 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleic acid sequence shown 

in Figure 1, 2 or 3 (SEQ ID NO:!, 3 or 5) or to the nucleic acid sequence of a 
deposited cDNA which do, in fact, encode polypeptides having TNFR protein 
activity. By "a polypeptide having TNFR activity" is intended polypeptides 
exhibiting activity similar, but not necessarily identical, to an activity of a 
30 mature or extracellular forms of a TNFR-5, -6a or -6(1 protein of the 



invention, as measured in a particular biological assay. The TNF family 

ligands induce various cellular responses by binding to TNF-family receptors, 

including the TNFR-5, -6a and -6(J of the present invention. Cells which 

express the TNFR proteins are believed to have a potent cellular response to 
TNFR-I receptor ligands including B lymphocytes (CD19+), both CD4 and 
CD8+ T lymphocytes, monocytes and endothelial cells. By a "cellular response 
to a TNF-family ligand" is intended any genotypic, phenotypic, and/or 
morphological change to a cell, cell line, tissue, tissue culture or patient that is 
induced by a TNF-family ligand. As indicated, such cellular responses include 
not only normal physiological responses to TNF-family ligands, but also 
diseases associated with increased cell proliferation or the inhibition of increased 
cell proliferation, such as by the inhibition of apoptosis. 

Screening assays for the forgoing are known in the art. One such 
screening assay involves the use of cells which express the receptor (fc r 
example, transfected CHO cells) in a system which measures extracellular pH 
changes caused by receptor activation, for example, as described in Science 
246:181-296 (October 1989). For example, a TNF-family ligand may be 
contacted with a cell which expresses the mature form of the receptor 
polypeptide of the present invention and a second messenger response, e.g., 
signal transduction or pH changes, may be measured to determine whether the 
TNFR polypeptide is active. 

Of course, due to the degeneracy of the genetic code, one of ordinary 
skill in the art will immediately recognize that a large number of the nucleic 
acid molecules having a sequence at least 90%, 95%, 96%, 97%, 98%, or 99% 
identical to the nucleic acid sequence of a deposited cDNA or the nucleic acid 
sequence shown in Figure 1, 2 or 3 (SEQ ID NO: I, 3 or 5) will encode a 
polypeptide "having TNFR protein activity." In fact, since degenerate 
variants of these nucleotide sequences all encode the same polypeptide, this 
will be clear ;»». sk: : Ld artisan even without performing the above described 
comparison a^y. It will be further recognized in the art that, for such nucleic 
acid molecules that arc not degenerate variants, a reasonable number will also 
encode a polypeptide having TNFR protein activity. This is because the 
skilled artisan is fully aware of ammo acid substitutions that are either less 
likely or not likely to significantly effect protein function (e.g., replacing one 
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^0 aliphatic amino acid with a second aliphatic amino acid), as further described 
below. 

Vectors and Host Cells 

The present invention also relates to vectors which include the isolated 
5 DNA molecules of the present invention, host cells which are genetically 

engineered with the recombinant vectors, and the production of TNFR 
polypeptides or fragments thereof by recombinant techniques. The vector 
may be, for example, a phage, plasmid, viral or retroviral vector. Retroviral 
vectors may be replication competent or replication defective. In the latter 
10 case, viral propagation generally will occur only in complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable 
marker for propagation in a host. Generally, a plasmid vector is introduced in 
a precipitate, such as a calcium phosphate precipitate, or in a complex with a 
charged lipid. If the vector is a vims, it may be packaged in vitro using an 
15 appropriate packaging cell line and then transduced into host cells. 

The DNA insert should be operatively linked to an appropriate 
promoter, such as the phage lambda PL promoter, the £. coli lac, trp. phoA and 
tac promoters, the SV40 early and late promoters and promoters of retroviral 
LTRs, to name a few. Other suitable promoters will be known to the skilled 
20 artisan. The expression constnicts will further contain sites for transcription 

initiation, termination and, in the transcribed region, a ribosome binding site for 
translation. The coding portion of the transcripts expressed by the constructs 
will preferably include a translation initiating codon at the beginning and a 
termination codon (UAA, UGA or UAG) appropriately positioned at the end 
25 of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolate reductase, G418 or 
neomycin resistance for cukaryotic cell culture and tetracycline, kanamycin or 
ampicillin resistance genes for culturing in E. coli and other bacteria. 



Rq>resentative examples of appropriate hosts include, but are not limited to, 
bacterial cells, such as F. colt, Streptomyces and Salmonella typhimuhum cells; 
fungal cells, such as yeast cells; insect cells such as Drosophila S2 and 
Spodoptera Sf9 cells; animal cells such as CHO, COS, 293 and Bowes 
melanoma cells; and plant cells. Appropriate culture mediums and conditions 
for the above-described host cells are known in the art. 
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Among vectors preferred for use in bacteria include pQE70, pQE60 
and pQE-9, available from QI AGEN, Inc., supra: Phagescript vectors, 
Bluescript vectors, pNH8A, pNH16a, pNH18A. pNH46A, available from 
Stratagene; and ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 available 
from Pharmacia. Among preferred eukaryotic vectors are pWLNFO, 
pSV2CAT, pOG44, pXTl and pSG available from Stratagene; and pSVK3. 
pBPV, pMSG and pSVL available from Pharmacia. Other suitable vectors will 
be readily apparent to the skilled artisan. Introduction of ihe construct into 
the host cell can be effected by calcium phosphate transfection, DEAE-dextran 
mediated transfection, cationic lipid-mediated transfection, electroporation, 
transduction, infection or other methods. Such methods are described in many 
standard laboratory manuals, such as Davis el aL, Basic Methods In Molecular 
Biology (1986). The polypeptide may be expressed in a modified form, such 
as a fusion protein, and may include not only secretion signals, but also 
additional heterologous functional regions. For instance, a region of additional 
amino acids, particularly charged amino acids, may be added to the N-terminus 
of the polypeptide to improve stability and persistence in the host cell, during 
purification, or during subsequent handling and storage. Also, peptide 
moieties may be added to the polypeptide to facilitate purification. Such 
regions may be removed prior to final preparation of the polypeptide. The 
addition of peptide moieties to polypeptides to engender secretion or 
excretion, to improve stability and to facilitate purification, among others, are 
familiar and routine techniques in the art. A preferred fusion protein 
comprises a heterologous region from immunoglobulin that is useful to 
stabilize and purify proteins. For example, EP-A-O 464 533 (Canadian 
counterpart 2045869) discloses fusion proteins comprising various portions of 
constant region of immunoglobulin molecules together with another human 
protein or part thereof. In many cases, the Fc part in a fusion protein is 
thoroughly advantageous for use in therapy and diagnosis and thus results, for 
cxample in improved pharmacokinetic properties (EP-A 0232 262). On the 
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other hand, for some uses it would be desirable to be able to delete the Fc part 
after the fusion protein has been expressed, detected and purified in the 
advantageous manner described. This is the case when Fc portion proves to be 
a hindrance to use in therapy and diagnosis, for example when the fusion 
protein is to be used as antigen for immunizations. In drug discovery, for 
example, human proteins, such as hIL-5, have been fused with Fc portions for 
the purpose of high-throughput screening assays to identify antagonists of 
hIL-5. See, D. Bennett et at., J- Molecular Recognition <?:52-58 (1995) and K. 
Johanson et aL J. Biol Chem, 2 70:9459-947 J (1995). 

The TNFR proteins can be recovered and purified from recombinant 
cell cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, 
affinity chromatography, hydroxylapatite chromatography and lectin 
chromatography. Most preferably, high performance liquid chromatography 
("HPLC") is employed for purification. Polypeptides of the present invention 
include: products purified from natural sources, including todily fluids, 
tissues and cells, whether directly isolated or cultured; products of chemical 
synthetic procedures; and products produced by recombinant techniques from 
a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, 
higher plant, insect and mammalian cells. Depending upon the host employed 
in a recombinant production procedure, the polypeptides of the present 
invention may be glycosylated or may be non-glycosylated. In addition, 
polypeptides of the invention may a!:o include an initial modified methionine 
residue, in «ome cases as a result of host-mediated processes. 

Polypeptides and Fragments 

The invention further provides isolated TNFR polypeptides having 
the amino acid sequences encoded by the deposited cDNAs, or the amino acid 
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sequences in SEQ ID NOS2, 4 and 6, or a peptide or polypeptide comprising 
a portion of the above polypeptides. 

Variant and Mutant Polypeptides 

To improve or alter the characteristics of a TNFR polypeptide, 
protein engineering may be employed. Recombinant DNA technology known 
to those skilled in the art can be used to create novel mutant proteins or 
"muteins" including single or multiple amino acid substitutions, deletions, 
additions or fusion proteins. Such modified polypeptides can show, e.g., 
enhanced activity or increased stability. In addition, they may be purified in 
higher yields and show better solubility than the corresponding natural 
polypeptide, at least under certain purification and storage conditions. 

N-Terminal and C-Terminal Deletion Mutants 

For instance, for many proteins, including the extracellular domain of a 
membrane associated protein or the mature form(s) of a secreted protein, it is 
known in the art that one or more amino acids may oe deleted from the N- 
terminus or C-terminus without substantial loss of biological function. For 
instance, Ron et al., J Biol Ckcm.. 265:2984-2988 (1993) reported modified 
KGF proteins that had heparin binding activity even if 3, 8, or 27 amino- 
terminal amino acid residues were missing. In the present case, since the 
proteins of the invention are members of the TNFR polypeptide family, 
deletions of N-terminal amino acids up to the Cysteine at position 53 of SEQ 
ID NO:2 (TNFR-5), and 49 of SEQ ID NOS:4 and 6 (TNFR-6a and -6p) may 
retain some biological activity such as regulation of proliferation and apoptosis 
of lymphoid cells. Polypeptides having further N-terminal deletions including 
the C53 residue in SEQ ID NO:2, or the C49 residue in SEQ ID NOS:4 and 6, 
would not be expected to retain such biological activities because it is known 
that these residues in a TNFR-related polypeptide are required for forming a 
disulfide bridge to provide structural stability which is needed for receptor 
binding and signal transduction. 
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However, even if deletion of one or more amino acids from the N- 
terminus of a protein results in modification of loss of one or more biological 
functions of the protein, other biological activities may still be retained. Thus, 
the ability of the shortened protein to induce and/or bind to antibodies which 
recognize the complete or extracellular domain of the TNFR protein generally 
will be retained when less than the majority of the residues of the complete 
protein or extracellular domain are removed from the N-terminus. Whether a 
particular polypeptide lacking N-terminal residues of a complete protein 
retains such immunologic activities can readily be determined by routine 
methods described herein and otherwise known in the art. 

Accordingly, the present invention further provides polypeptides 
having one or more residues deleted from the amino terminus of the amino acid 
sequence of the TNFR shown in SEQ ID NOS:2, 4 and 6, up to the cysteine 
residue in each which are at position numbers 53, 49 and 49, respectively, and 
polynucleotides encoding such polypeptides. In particular, the present 
invention provides TNFk-5 polypeptides comprising the amino acid sequence 
of residues m-259 of SEQ ID NO:2 where rn is an integer in the range of 1-53 
where 53 is the position of the first cysteine residue from the N-terminus of 
the complete TNFR- 5 polypeptide (shown in SEQ ID NO:2) believed to be 
required for activity of the TNFR-5 protein; and TNFR -6a and -6P 
polypeptides comprising the amino acid sequence of residues u-300 and 170 of 
SEQ ID NOS:4 and 6, respectively where n is an integer in the range of 1-49 
where 49 is the position of the first cysteine residue from the N-terminus of 
the complete TNFR-6a and -6P polypeptides (shown in SEQ ID NOS:4 and 
6, respectively) believed to be required for activity of the TNFR-6ot and -6p 
proteins. 

More in particular, the invention provides polynucleotides encoding 
polypeptides having the amino acid sequence of residues: 1-259, 2-259, 3-259, 
4-259, 5-259, 6-259, 7-259, 8-259, 9-259, 10-259, 1 1-259, 12-259, 13-259, 
14-259, 15-259, 16-259, 17-259, 18-259, 19-259, 20-259,21-259, 22-259, 23- 



-31- 





10 



15 



30 



259, 24-259, 25-259, 26-259, 27-259, 28-259, 29-259, 30-259, 31-259, 32- 
259, 33-259, 34-259, 35-259, 36-259, 37-259, 38-259, 39-259, 40-259, 41- 
259, 42-259, 43-259, 44-259, 45-259, 46-259, 47-259, 48-259, 49-259, 50- 
259, 51-259, 52-259, and 53-259 of SEQ ID NO:2; 1-300, 2-300, 3-300, 4- 
300, 5-300, 6-300, 7-300, 8-300, 9-300, 10-300, 11 -300, 12-300, 13-300, 14- 
300, 15-300, 16-300, 17-300, 18-300, 19-300, 20-300,21-300, 22-300,23- 
300, 24-300, 25-300, 26-300, 27-300, 28-300, 29-300, 30-300, 3 1 -300, 32- 
300, 33-300, 34-300, 35-300, 36-300, 37-300, 38-300, 39-300, 40-300, 41- 
300, 42-300, 43-300, 44-300, 45-300, 46-300, 47-300, 48-300, and 49-300 of 
SEQ ID NO:4; and 1-1 70, 2- 1 70, 3- 1 70, 4- 1 70, 5- 1 70, 6- 1 70, 7- 1 70, 8- 1 70, 9- 
170, 10-170, 11-170, 12-170, 13-170, 14-170, 15-170, 16-170, 17-170, 18- 
1 70, 1 9- 1 70, 20- 1 70, 2 1 - 1 70, 22- 1 70, 23- 1 70, 24- 1 70, 25- 1 70, 26- 1 70, 27- 
1 70, 28- 1 70, 29- 1 70, 30- 1 70, 3 1 - 1 70, 32- 1 70, 33- 1 70, 34- 1 70, 3 5- 1 70, 36- 
1 70, 37- 1 70, 38- 1 70, 39- 1 70, 40- 1 70, 4 1 - 1 70, 42- 1 70, 43- 1 70, 44- 1 70, 45- 
1 70, 46- 1 70, 47- 1 70, 48- 1 70, and 49- 1 70 of ^EQ ID NO:6. Polynucleotides 
encoding these polypeptides also are provided. 

Similarly, many examples of biologically functional C-terminal deletion 
muteins are known. For instance, interferon gamma shows up to ten times 
higher activities by deleting 8-10 amino acid residues from the carboxy 
terminus of the protein (Dobeli et ah, J. Biotechnology 7:199-216 (1988)). In 
the present case, since the protein of the invention is a member of the TNFR 
polypeptide family, deletions of C-terminal amino acids up to the cysteine at 
position 149, 193 and 132 of SEQ ID NOS:2, 4 and 6, respectively, may 
retain some biological activity such as regulation of proliferation and apoptosis 
of lymphoid cells. Polypeptides having further C-terminal deletions including 
the cysteines at positions 149, 193, and 132 of SEQ ID NOS:2. 4 and 6, 
respectively, would not be expected to retain such biological activities because 
it is known that these residues in TNF receptor-related polypeptides are 
required for forming disulfide bridges to provide structural stability which is 
needed for receptor binding. 
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However, even if deletion of one or more amino acids from the C- 
terminus of a protein icsults in modification of loss of one or more biological 
functions of the protein, other biological activities may still be retained. Thus, 
the ability of the shortened protein to induce and/or bind to antibodies which 
5 recognize the complete or mature form of the protein generally will be retained 

when less than the majority of the residues of the complete or mature form 
protein are removed from the C-terminus. Whether a particular polypeptide 
lacking C-terminal residues of a complete protein retains such immunologic 
activities can readily be determined by routine methods described herein and 
10 otherwise known in the art. 

Accordingly, the present invention further provides polypeptides 
having one or more residues from the carboxy terminus of the amino acid 
sequence of TNFR-5, -6a and -6p shown in SEQ ID NOS:2, 4 and 6 up to the 
cysteine at position 149, 193 and 132 of SEQ ID NOS:2, 4 and 6, 

15 respectively, and polynucleotides encoding such polypeptides. In particular, 

the present invention provides polypeptides having the amino acid sequence 
of residues 1-x, 1-y, and 1-z of the amino acid sequence in SEQ ID NOS:2, 4 
and 6, respectively, where x is any integer in the range of 149-259, where y is 
any integer in the range of 193-300, and z is any integer in the range of 1 32- 

20 1 70. Polynucleotides encoding these polypeptides also are provided. 

The invention also provides polypeptides having one or more amino 
acids deleted from both the amino and the carboxyl termini, which may be 
described generally as having residues m-x of SEQ ID NO:2, n-y of SEQ ID 
NO:4, and n-z of SEQ ID NO:6 where m, n, x, y and z are integers as 

25 described above. 

Also included are a nucleotide sequence encoding a polypeptide 
consisting of a portion of a complete TNFR ammo acid sequence encoded by a 
cDNA clone contained in ATCC Deposit No. 97798, 97810, or 97809. where 
this portion excludes from I to about 49, 53. or 53 amino acids from the amino 

30 terminus of the complete amino acid sequence encoded by the cDNA clone 
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contained in ATCC Deposit No. 97798, 97810, and 97809, respectively, or 
from 1 to about 1 10, 107, or 58 amino acids from the carboxy terminus of the 
complete amino acid sequence encoded by the cDNA clone contained in 
ATCC Deposit No. 97798, 97810 and 97809, respectively, or any 
5 combination of the above amino terminal and carboxy terminal deletions, of the 

complete amino acid sequence encoded by the cDNA clone contained in 
ATCC Deposit No. 97798, 97810 or 97809. Polynucleotides encoding all of 
the above deletion mutant polypeptide forms also are provided. 

Other Mutants 

10 In addition to terminal deletion forms of the protein discussed above, it 

also will be recognized by one of ordinary skill in the art that some amino acid 
sequences of the TNFR polypeptides can be varied without significant effect 
on the structure or function of the proteins. If such differences in sequence are 
contemplated, it should be remembered that there will be critical areas on the 

I .s protein which determine activity. 

Thus, the invention further includes variations of the TNFR 
polypeptides which show substantial TNFR polypeptide activity or which 
include regions of TNFR protein such as the protein portions discussed below. 
Such mutants include deletions, insertions, inversions, repeats, and type 

20 substitutions selected according to general rules known in the art so as have 

little effect on activity. For example, guidance concerning how to make 
phcnorypically silent amino acid substitutions is provided in Bowie, J. U. et 
al., "Deciphering the Message in Protein Sequences: Tolerance to Amino Acid 
Substitutions," Science 247: 1306-1 3 10 (1990), wherein the authors indicate 

25 that there arc two main approaches for studying the tolerance of an amino acid 

sequence to change. The first method relies on the process of evolution, in 
which mutations are either acvep'cd or rejected by natural selection The 
second approach uses genetic engineering to introduce amino acid changes at 
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specific positions of a cloned gene and selections or screens to identify " 
sequences that maintain functionality. 

As the authors state, these studies have revealed that proteins are surprisingly 
tolerant of amino acid substitutions. The authors further indicate which amino 
acid changes are likely to be permissive at a certain position of the protein. 
For example, most buried amino acid residues require nonpolar side chains, 
whereas few features of surface side chains are generally conserved. Other 
such phenotypically silent substitutions are described in Bowie, J. U. . aL 
supra, and the references cited therein. Typically seen as conservative 
substitutions are the replacements, one for another, among the aliphatic amino 
acids Ala, Val, Leu and lie; interchange of the hydroxy! residues Ser and Thr, 
exchange of the acidic residues Asp and Glu, substitution between the amide 
residues Asn and Gin, exchange of the basic residues Lys and Arg and 
replacements among the aromatic residues Phe, Tyr. Thus, the fragment, 
derivative or analog of the polypeptide of SEQ ID NO:2, 4 or 6, or that 
encoded by a deposited cDNA, may be (i) one in which one or more of the 
amino acid residues are substituted with a conserved or non-conserved amino 
acid residue (preferably a conserved amino acid residue) and such substituted 
amino acid residue may or may not be one encoded by the genetic code, or (ii) 
one in which one or more of the amino acid residues includes a substituent 
group, or (iii) one in which the mature jr soluble extracellular polypeptide is 
fused with another compound, such as a compound to increase the half-life of 
the polypeptide (for example, polyethylene glycol), or (iv) one in which the 
additional amino acids arc fused to the above form of the polypeptide, such as 
an IgG Fc fusion region peptide or leader or secretory sequence or a sequence 
which is employed for purification of the above form of the polypeptide or a 
proprotcin sequence. Such fragments, derivatives and analogs arc deemed to be 
within the scope of those skilled in the art from the teachings herein. 

Thus, the TNFR of the present invention may include one or more 
amino acid substitutions, deletions or additions, either from natural mutations 
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r- human manipulation. As indicated, changes are preferably of a minor 
nature, such as conservative amino acid substitutions that do not significantly 
affect the folding or activity of the protein (see Table 1). 
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TABLE 1. Conservative Amino Acid Substitutions. 



Aromatic 


Phenylalanine 




Tryptophan 




TvrfKtne 


Hydrophobic 


Leucine 




I v> leucine 




Valine 


Polar 


Glutamine 




Asoaraeine 


Basic 


Arginine 




Lysine 






Acidic 






Glutamic Acid 


Small 


Alanine 




Serine 




Threonine 




Methionine 




Glycine 



Amino acids in the TNFR proteins of the present invention that are 
essential for function can be identified by methods known in the art, such as 
site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham and 
Wells, Science 244: 1 08 1 - 1 085 ( 1 989)). The latter procedure introduces single 
alanine mutations at every residue in the molecule. The resulting mutant 
molecules are then tested for biological activity such as receptor binding or in 
vitro or in vitro proliferative activity. 

Of special interest are substitutions of charged amino acids with other 
charged or neutral amino acids which may produce proteins with highly 
desirable improved characteristics, such as less aggregation. Aggregation may 
not only reduce activity but also be problematic when preparing 
pharmaceutical formulations, because aggregates can be immunogenic (Pinckard 
et aL Clin. Exp, Immunol. 2:33 1-340 (1967); Robbins et aL Diabetes 36: 838- 




845 (1987); Clcland et al. Crit. Rev. Therapeutic Drug Carrier Systems 
70:307-377(1993). 

Replacement of amino acids can also change the selectivity of the 
binding of a ligand to cell surface receptor. For example, Ostade et al.. Nature 
567:266-268 (1993) describes certain mutations iesulting in selective binding 
of TNF-a to only one of the two known types of TNF receptors. Sites that 
are critical for ligand-receptor binding can also be determined by structural 
analysis such as crystallization, nuclear magnetic resonance or photoaffinity 
labeling (Smith et al, J. Mol Biol 22*899-904 (1992) and de Vos et al 
Science 255:306-3 1 2 ( 1 992)). 

Since TNFR-5, -6a and -6|J are members of the TNF receptor-related 
protein family, to modulate rather than completely eliminate biological 
activities of TNFR preferably mutations are made in sequences encoding 
amino acids in the TNFR conserved extracellular domain, more preferably in 
residues within this region which are not conserved among members of the 
TNF receptor family. Also forming part of the present invention are isolated 
polynucleotides comprising nucleic acid sequences which encode the above 
TNFR mutants. 

The polypeptides of the present invention are preferably provided in 
an isolated form, and preferably are substantially purified. A recombinantly 
produced version of the TNFR polypeptides can be substantially purified by 
the one-step method described in Smith and Johnson, Gene 67:3 1-40 (1988). 
Polypeptides of the invention also can be purified from natural or recombinant 
sources using anti-TNFR-5, -6a and -6P antibodies of the invention in 
methods which are well known in the art of protein purification. 
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The invention further provides isolated TNFR polypeptides 
comprising an amino acid sequence selected from the group consisting of (a) 
the amino acid sequence of a full-length TNFR polypeptide having the 
complete amino acid sequence shown in SEQ ID NO:2, 4 or 6 or as encoded 
by a cDNA clone contained in ATCC Deposit No. 97798, 97810 or 97809; 
(b) the amino acid sequence of a mature TNFR polypeptide having the ammo 
acid sequence at positions 27-259 in SEQ ID NO:2, 31-300 in SEQ ID NO:4. 
or 3 1 - 1 70 in SEQ ID NO:6. or as encoded by a cDNA clone contained in 
ATCC Deposit No. 97798, 97810 or 97809; or (c) the amino acid sequence of 
a soluble extracellular domain of a TNFR polypep ; having the amino acid 
sequence at positions 27 to 240 in SEQ ID NO:2, 31 to 283 in SEQ ID NO:4. 
or 3 1 to 1 66 in SEQ ID NO:6, or as encoded by the r DN A clone contained in 
ATCC Deposit No. 97798, 97810 or 97809. 

Further polypeptides of the present invention include polypeptides 
which have at least 90% similarity, more preferably at least 95% similarity, 
and still more preferably at least 96%, 97%, 98% or 99% similarity to those 
described above. The polypeptides of the invention also comprise those 
which are at least 80% identical, more preferably at least 90% or 95% 
identical, still more preferably at lens: 96%, 97%, 98% or 99% identical to the 
polypeptide encoded by the deposited cDNA or to the polypeptide of SEQ 
ID NO:2, 4 or 6, and also include portions of such polypeptides with at least 
30 amino acids and more preferably at least 50 amino acids. 

By "% similarity" for two polypeptides is intended a similarity score 
produced by comparing the amino acid sequences of the tw o polypeptides 
using the Bcstfit program (Wisconsin Sequence Analysis Package, Version 8 
for Unix, Genetics Computer Group, University Research Park, 575 Science 
Drive, Madison, WI 5371 1 ) and the default settings for determining similarity. 
Bcstfit uses the local homology algorithm of Smith and Waterman (Advances 
in Applied Mathematics 2:482-489, 1981 ) to find the best segment of 
similarity between two sequences. 




By a polypeptide having an amino acid sequence at least, for example, 
95% "identical" to a reference amino acid sequence of a TNFR polypeptide is 

intended that the amino acid sequence of-the polypeptide is. identical to .the „ 

reference sequence except that the polypeptide sequence may include up to 
five amino acid alterations per each 100 amino acids of the reference amino acid 
of the TNFR polypeptide. In other words, to obtain a polypeptide having an 
amino acid sequence at least 95% identical to a reference amino acid sequence, 
up to 5% of the amino acid residues in the reference sequence may be deleted 
or substituted with another amino acid, or a number of amino acids up to 5% 
of the total amino acid residues in the reference sequence may be inserted into 
the reference sequence. These alterations of the reference sequence may occur 
at the amino or carboxy terminal positions of the reference amino acid 
sequence or anywhere between those terminal positions, interspersed either 
individually among residues in the reference sequence or in one or more 
contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 
90%, 95%, 96%, 97%, 98% or 99% identical to, for instance, the amino acid 
sequence shown in SEQ ID NO:2, 4 or 6, or to an amino acid sequence 
encoded by a deposited cDNA clone can be determined conventionally using 
known computer programs such the Bestf.t program (Wisconsin Sequence 
Analysis Package, Version 8 for Unix, Genetics Computer Group, University 
Research Park, 575 Science Drive, Madison, WI 5371 1). When using Bestfit 
or any other sequence alignment program to determine whether a particular 
sequence is, for instance, 95% identical to a reference sequence according to the 
present invention, the parameters are set, of course, such that the percentage 
of identity is calculated over the full length of the reference amino acid 
sequence and that gaps in homology of up to 5% of the total number of amino 
acid residues in the reference sequence are allowed. 
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The polypeptide of the present invention could be used as a molecular 
weight marker on SDS-PAGE gels or on molecular sieve gel filtration columns 
using methods well known to those of skill in the art. 

As described in detail below, the polypeptides of the present invention 

5 can also be used to raise polyclonal and monoclonal antibodies, which are 

useful in assays for detecting TNFR protein expression as described below or 
as agonists and antagonists capable of enhancing or inhibiting TNFR protein 
function. Further, such polypeptides can be used in the yeast two-hybrid 
system to "capture" TNFR protein binding proteins which are also candidate 

10 agonists and antagonists according to the present invention. The yeast two 

hybrid system is described in Fields and Song, Nature 340:245-246 (1989). 

Epitcpe-Bearing Portions 

In another aspect, the invention provides a peptide or polypeptide 
comprising an epitope-bearing portion of a polypeptide of the invention. The 

15 epitope of this polypeptide portion is an immunogenic or antigenic epitope of 

a polypeptide of the invention. An "immunogenic epitope" is defined as a 
part of a protein that elicits an antibody response when the whole protein is 
the immunogen. On the other hand, a region of a protein molecule to which an 
antibody can bind is defined as an "antigenic epitope." The number of 

20 immunogenic epitopes of a protein generally is less than the number of 

antigenic epitopes. See, for instance, Geysen et ai. t Proc. Natl. Acad. ScL USA 
81399%- 4002(1983). 

As to the selection of peptides or polypeptides bearing an antigenic 
epitope (i.e., that contain a region of a protein molecule to which an antibody 

25 can bind), it is well known in that art that relatively short synthetic peptides 

that mimic part of a protein sequence are routinely capable of eliciting an 
antiserum that reacts with the partially mimicked protein. See, for instance, 
Sutcliffe, J. G., Shinnick, T. M., Green, N. and Learner, R. A. ( 1983) 
"Antibodies that react with predetermined sites on proteins," Science. 

30 2/9:660-666. Peptides capable of eliciting protein-reactive sera are frequently 
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reprcsented in the primary sequence of a protein, can be characterized by a \ct 
of simple chemical rules, and are confined neither to immunodominant regions 
of intact proteins (i.e., immunogenic epitopes) nor to the amino or carboxyl 
terminals. Antigenic epitope-bearing peptides and polypeptides of the 
invention are therefore useful to raise antibodies, including monoclonal 
antibodies, that bind specifically to a polypeptide of the invention. See, for 
instance, Wilson et aL Cell 57:767-778 (1984) at 777. 

Antigenic epitope-bearing peptides and polypeptides of the invention 
preferably contain a sequence of at least seven, more preferably at least nine 
and most preferably between about 15 to about 30 amino acids contained 
within the amino acid sequence of a polypeptide of the invention. 
Non-limiting examples of antigenic polypeptides or peptides that can be used 
to generate TNFR-specific antibodies include: a polypeptide comprising amino 
acid residues from abou* Gln-42 to about Glu-52, from about His-58 to about 
Cys-66, from about Pro-68 to about Thr-76, from about Ser-79 to about Cys- 
85, from about Cys-91 to about Thr-102, from about Gln-l 10 to about Pro- 
122, from about Arg-126 to about Val-136, and from about Thr-142 to about 
Glu-148 in SEQ ID NO:2; from about Ala-3 1 to about Thr-46. from about 
Phe-57 to about Thr-1 17, from about Cys-132 to about Thr-1 75, from about 
Gly-185 to about Thr-1 94, from about Val-205 to about Asp-217, from about 
Pro-239 to about Leu-264, and from about Ala-283 to about Pro-298 in SEQ 
ID NO:4; and from about Ala-3 1 to about Thr-46, from about Phe-57 to about 
Gln-80, from about Glu-86 to about His- 106, from about Thr-108 to about 
Phe-1 19, from about His- 129 to about VaI-138, and from about Gly-142 to 
about Pro-166 in SEQ ID NO:6. These polypeptide fragments have been 
determined to bear antigenic epitopes of the TNFR-5, -6a and -60 
polypeptides respectively, by the analysis of the Jameson- Wolf antigenic index, 
as shown in Figures 6 and 7, above. 

The epitope-bearing peptides and polypeptides of the invention may 
be produced by any conventional means. See, e.g., Houghten, R. A. ( 1985) 
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"General method for the rapid solid-phase synthesis of large numbers of 
peptides: specificity of antigen-antibody interaction at the level of individual 
amino acids" Proc. Natl. Acad Sci. USA *2;513l-5135; this 'Simultaneous 
Multiple Peptide Synthesis (SMPS)" process is further described in U.S. 
5 Patent No. 4,63 1.211 to Houghten et a!, ( 1 986). 

Epitope-beanng peptides and polypeptides of the invention are used 
to induce antibodies according to methods well known in the art. See, for 
instance, Sutcliffe et al., supra; Wilson et al., supra; Chow, M et aL Proc. 
Natl. Acad. Sci. USA 52:910-914; and Bittle, F. J. et al. J. Gen. Virol 
10 66:2347-2354 (1 985). Immunogenic epitope-bearing peptides of the 

invention, i.e., those parts of a protein that elicit an antibody response when 
the whole protein is the immunogen, are identified according to methods 
known in the art. See, for instance, Geysen et al., supra. Further still, U.S. 
Patent No. 5,194,392 to Geysen (1990) describes a general method of 
1 5 detecting or determining the sequence of monomers (amino acids or other 

compounds) which is a topological equivalent of the epitope (i.e., a 
"mimotope") which is complementary to a particular paratope (antigen binding 
site) of an antibody of interest. More generally, U.S. Patent No. 4,433.092 to 
Geysen (1989) describes a method of detecting or determining a sequence of 
20 monomers which is a topographical equivalent of a ligand which is 

complementary to the ligand binding site of a particular receptor of interest. 
Similarly, U.S. Patent No. 5,480,971 to Houghten, R. A. et al. (1996) on 
Peralkylated Oligopeptide Mixtures discloses linear CI -C7-alkyl peralkylated 
oligopeptides and sets and libraries of such peptides, as well as methods for 
25 using such oligopeptide sets and libraries for determining the sequence of a 

peralkylated oligopeptide that preferentially binds to an acceptor molecule of 
interest. Thus, non-peptide analogs of the epitope-bearing peptides of the 
invention also can be made routinely by these methods. 
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Fusion Proteins 

As one of skill in the art will appreciate, TNFR polypeptides of the 
present invention and the epitope-bearing fragments thereof described above 
can be combined with parts of the constant domain of immunoglobulins (IgG>, 
resulting in chimeric polypeptides. These fusion proteins facilitate 
purification and show an increased half-life in vivo. This has been shown, e.g., 
for chimeric proteins consisting of the first two domains of the human 
CD4-polypeptide and various domains of the constant regions of the heavy or 
light chains of mammalian immunoglobulins (EP A 394,827; Traunecker et aL 
Nature 331 :84-86 (1988)). Fusicn proteins that have a disulfide-linked dimeric 
structure due to the IgG part can also be more efficient in binding and 
neutralizing other molecules than the monomelic TNFR protein or protein 
fragment alone (Fountoulakis et aL J- Biochem. 270:3958-3964 (1995)). 

Antibodies 

TNFR-protein specific antibodies for use in the present invention can 
be raised against the intact TNFR-5, -6a and -60 proteins or an antigenic 
polypeptide fragment thereof, which may be presented together with a carrier 
protein, such as an albumin, to an animal system (such as rabbit or mouse) or, 
if it is long enough (at least about 25 amino acids), without a carrier. 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" 
(Mab) is meant to include intact molecules as well as antibody fragments (such 
as, for example. Fab and F(ab')2 fragments) which are capable of specifically 
binding to a TNFR protein. Fab and F(ab')2 fragments lack the fc fragment of 
intact antibody, clear more rapidly from the circulation, and may have less 
non-specific tissue binding of an intact antibody (Wahl et aL J NucL Med, 
24:316-325 (1983)). Thus, these fragments are preferred. 

The antibodies of the present invention may be prepared by any of a 
variety of methods. For example, cells expressing the TNFR protein or an 
antigenic fragment thereof can be administered to an animal in order to induce 
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the production of sera containing polyclonal antibodies. In a preferred 
method,.a preparation of TNFR protein is prepared and purified to render it 
substantially free of natural contaminants. Such a preparation is then 
introduced into an animal in order to produce polyclonal antisera of greater 
5 specific activity. 

In the most preferred method, the antibodies of the present invention 
are monoclonal antibodies. Such monoclonal antibodies can be prepared using 
hybridoma technology (Kohler et aL Nature 256:495 (1975); Kohler et aL 
Eur. J. Immunol 6:51 1 (1976); Kohler et aL, Eur. J. Immunol. 6:292 (1976); 
10 Hammerling et aL in: Monoclonal Antibodies and T-Cell Hybridomas. 

Elsevier, N.Y., (1981) pp. 563-681 ). In general, such procedures involve 
immunizing an animal (preferably a mouse) with a TNFR protein antigen or, 
more preferably, with a TNFR protein-expressing cell. Suitable cells can be 
recognized by their capacity to bind anti-TNFR-5, -6a oi -6p protein 
15 antibody. Such cells may be cultured in any suitable tissue culture medium; 

however, it is preferable to culture cells in Earths modified Eagle's medium 
supplemented with 10% fetal bovine serum (inactivated at about 56° C), and 
supplemented with about 10 g/1 of nonessential amino acids, about 1,000 U'ml 
of penicillin, and about 100 ug/ml of streptomycin. The splenocytes of such 
20 mice are extracted and fused with a suitable myeloma cell line. Any suitable 

myeloma cell line may be employed in accordance with the present invention; 
however, it is preferable to employ the parent myeloma cell line (SP20), 
available from the American Type Culture Collection, Rockville. Maryland. 
After fusion, the resulting hybridoma cells are selectively maintained in HAT 
25 medium, and then cloned by limiting dilution as cescribed by Wands et al. 

(Gastroenterology 80:225-232 ( 1981 )). The hybridoma cells obtained through 
such a selection are then assayed to identify clones which secrete antibodies 
capable of binding the desired TNFR antigen. 

Alternatively, additional antibodies capable of binding to the TNFR 
30 "antigen may be produced in a two-step procedure through the use of 
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anti-idiotypic antibodies. Such a method makes use of the fact that antibodies 
are themselves antigens, and that, therefore, it is possible to obtain an antibody 
which binds to a second antibody. In accordance with this method, TNFR- 
protein specific antibodies are used to immunize an animal, preferably a 
mouse. The spienocytes of such an animal are then used to produce 
hybridoma cells, and the hybridoma cells are screened to identify clones which 
produce an antibody whose ability to bind to the TNFR protein-specific 
antibody can be blocked by the TNFR protein antigen. Such antibodies 
comprise anti-idiotypic antibodies to the TNFR protein-specific antibody and 
can be used to immunize an animal to induce formation of further TNFP 
protein-specific antibodies. 

It will be appreciated that Fab and F(ab')2 and other fragments of the 
antibodies of the present invention may be used according to the methods 
disclosed herein. Such fragments are typically produced by proteolytic 
cleavage, using enzymes such as papain (to produce Fab fragments) or pepsin 
(to produce F(ab')2 fragments). Alternatively, TNFR protein-binding 
fragments can be produced through the application of recombinant DNA 
technology or through synthetic chemistry. 

For /it vivo use of anti-TNFR in humans, it may be preferable to use 
"humanized" chimeric monoclonal antibodies. Such antibodies can be 
produced using genetic constructs derived from hybridoma cells producing the 
monoclonal antibodies described above. Methods for producing chimeric 
antibodies are known in the art. See, for review, Morrison, Science 229: 1 202 
(1985); Oi ct al.. BioTcchniques 4:214 (1986); Cabilly et al., U.S. Patent No. 
4,816,567; Taniguchi et al., EP 171496; Morrison et al., EP 173494; Neuberger 
et al, WO 8601533; Robinson et al., WO 8702671; Boulianne ct aL Nature 
J/2:643 (1984); Neuberger et aL. Nature 5/4:268 (1985). 
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Immune System-Related Disorders 

Diagnosis 

The present inventors have discovered that TNFR-5, -6a and -6{J are 
expressed in hematopoeitic tissues. For a number of immune system-related 
disorders, substantially a!*ered (increased or decreased) levels of TNFR gene 
expression can be detected in immune system tissue or other cells or bodily 
fluids (e.g., sera and plasma) taken from an individual having such a disorder, 
relative to a "standard" TNFR gene expression level, that is, the TNFR 
expression level in immune system tissues or bodily fluids from an individual 
not having the immune system disorder. Thus, the invention provides a 
diagnostic method useful during diagnosis of an immune system disorder, 
which involves measuring the expression level of the gene encoding the TNFR 
protein in immune system tissue or other cells or body fluid from an individual 
and comparing the measured gene expression level with a standard TNFR gene 
expression level, whereby an increase or decrease in the gene expression level 
compared to the standard is indicative of an immune system disorder. 

In particular, it is believed that certain tissues in mammals with cancer 
express significantly reduced levels of the TNFR protein and mRNA encoding 
the TNFR when compared to a corresponding "standard" level- Further, it is 
believed that reduced levels of the TNFR protein can be detected in certain 
body fluids (e.g., sera and plasma) from mammals with such a cancer when 
compared to sera from mammals of the same species not having the cancer. 

Thus, the invention provides a diagnostic method useful during 
diagnosis of an immune system disorder, including cancers which involves 
measuring the expression level of the gene encoding the TNFR protein in 
immune system tissue or other cells or body fluid from an individual and 
comparing the measured gene expression level with a standard TNFR gene 
expression level, whereby an increase or decrease in the gene expression level 
compared to the standard is indicative of an immune system disorder. 





Where a diagnosis of a disorder in the immune system including 
diagnosis of i. .u^nor has already been made according to conventional methods, 
the present invention is useful as a prognostic indicator, whereby patients 
exhibiting depressed gene expression will experience a worse clinical outcome 
relative to patients expressing the gene at a level nearer the standard level. 

By "assaying the expression level of the gene encoding a TNFR 
protein" is intended qualitatively or quantitatively measuring or estimating the 
level of the TNFR-5, -6a and/or -6p protein or the level of the mRNA 
encoding the TNFR-5, -6a and/or -6p protein in a first biological sample either 
directly (e.g., by determining or estimating absolute protein level or mRNA 
level) or relatively (e.g., by comparing to the TNFR protein level or mRNA 
level in a second biological sample). Preferably, the TNFR protein level or 
mRNA level in the first biological sample is measured or estimated and 
compared to a standard TNFR protein level or mRNA level, the standard being 
taken from a second biological sample obtained from an individual not having 
the disorder or being determined by averagir i levels from a population of 
individuals not having a disorder of the immune system. As will be 
appreciated in the art, once standard TNFR protein levels or mRNA levels are 
known, they can be used repeatedly as a standard for comparison. 

By "biological sample" is intended any biological sample obtained from 
an individual, body fluid, cell line, tissue culture, or other source which 
contains TNFR protein or mRNA. As indicated, biological samples include 
body fluids (such as sera, plasma, urine, synovial fluid and spinal fluid) which 
contain free extracellular domain(s) (or soluable form(s)) of a TNFR protein, 
immune system tissue, and other tissue sources found to express complete or 
extracellular domain of a TNFR. Methods for obtaining tissue biopsies and 
body fluids from mammals are well known in the art. Where the biological 
sample is to include mRNA, a tissue biopsy is the preferred source. 

The invention also contemplates the use of a gene of the present 
invention for diagnosing mutations in a TNFR gene. For example, if a 



mutation is present in one of the genes of the present invention, conditions 
would result from a lack of production of the receptor polypeptides (if the 
present invention. Further, mutations which enhance receptor polypeptide 
activity would lead to diseases associated with an over expression of the 
receptor polypeptide, e.g., endotoxic shock. Mutations in the genes can be 
detected by comparing the sequence of the defective gene with that of a normal 
one. Subsequently one can verify that a mutant gene is associated with a 
disease condition or the susceptibility to a disease condition. That is, a muunt 
gene which leads to the undcrexpression of the receptor polypeptides of the 
present invention would be associated with an inability of TNF to inhibit 
tumor growth. 

Other immune system disorders which may be diagnosed by the 
foregoing assays include hypersensitivity, allergy, infectious disease, graft- 
host disease, immunodeficiency, autoimmune diseases and the like 

Individuals carrying mutations in the genes of the present invention 
may be detected at ;he DNA level by a variety of techniques. Nucleic acids 
used for diagnosis may be obtained from a patient's cells, such as from blood, 
urine, saliva and tissue biopsy among other tissues. The genomic DNA may 
be used directly for detection or may be amplified enzymatically by using 
PCR (Saiki et al.. Nature, 324: 163-166 (1986)) prior to anaiyv.s. RNA or 
cDN A may also be used for the same purpose. As an example. PCR primers 
complementary to the nucleic acid of the instant invention can be used to 
identify and analyze mutations in the human genes of the present invention. 
For example, deletions and insertions can be detected by a change in the size of 
the amplified product in comparison to the normal genotype. Point mutations 
can be identified by hybridizing amplified DNA to radiolabeled RNA or 
alternatively, radiolabeled antisense DNA sequences of the present invention. 
Perfectly matched sequences can be distinguished from mismatched duplexes 
by RNasc A digestion or by differences in melting temperatures. Such a 
diagnostic wouid be particularly useful for prenatal or even neonatal testing. 



Sequence differences between the reference gene and "mutants" may he 
revealed by the direct DNA sequencing method. In addition, cloned DN A 
segments may be used as probes to detect specific DNA segments. The 
sensitivity of this method is greatly enhanced when combined with PCR. For 
example, a sequencing primary used with double stranded PCR product or a 
single stranded template molecule generated by a modified PCR product. The 
sequence determination is performed by conventional procedures with 
radiolabeled nucleotides or by automatic sequencing procedures with 
fluorescent tags. 

Sequence changes at the specific locations may be revealed by 
nuclease protection assays, such as RNase and SI protection or the chemical 
cleavage method (for example. Cotton et a!., PNAS, 85:4397-4401 (1985)). 

Assaying TNFR protein levels in a biological sample can occur using 
antibody-based techniques. For example. TNFR protein expression in tissues 
can be studied with classical immunohistological methods (Jalkanen, M„ ct ai 
J. Cell. Biol. 976-985 (1985); Jalkanen, M, et ai, J. Cell. Bioi 
705:3087-3096 (1987)). Other antibody-based methods useful for detecting 
TNFR gene expression include immunoassays, such as the enzyme linked 
immunosorbent assay (EL1SA) and the radioimmunoassay (RIA). Suitable 
antibody assay labels are known in the art and include enzyme labels, such as, 
glucose oxidase, and radioisotopes, such as iodine ( i:? I, l2, I), carbon ( ,4 C). 
sulfur i"S), tritium (*H), indium ( ,i: In), and technetium (Tch and 
fluorescent labels, such as fluorescein and rhodamine, and biotin. 

In addition to assaying TNFR protein levels in a biological sample 
obtained from an individual, TNFR proteins can also be detected in vivo by 
imaging. Antibody labels or markers for in vivo imaging of TNFR proteins 
include those detectable by X-radiography, NMR or HSR. For X-radiography 
suitable labels include radioisotopes such as barium or cc:ium, which emit 
detectable radiation but are not overtly harmful to the subject. Suitable 
markers for NMR and ESR include those with a detectable characteristic spin. 



such as deuterium, which may be incorporated into the antibody by labeling of 
nutrients for the relevant hybridoma. 

A TNFR-specific antibody or antibody fragment which has been 
labeled with an appropriate detectable imaging moiety, such as a radioisotope 
(for example, m I, " 2 ln, "Tc), a radio-opaque substance, or a material 
detectable by nuclear magnetic resonance, is introduced (for example, 
parenterally, subcutaneously or intraperitoneal^) into the mammal to be 
examined for immune system disorder. It will be understood in the art that the 
size of the subject and the imaging system used will determine the quantity of 
imaging moiety needed to produce diagnostic images. In the case of a 
radioisotope moiety, for a human subject, the quantity of radioactivity injected 
will normally range from about 5 to 20 millicuries of Tc. The labeled 
antibody or antibody fragment will then preferentially accumulate at the 
location of cells which contain TNFR protein, in vivo tumor imaging is 
described in S.W. Burchiel et al., "Immunopharmacokinetics of Radiolabeled 
Antibodies and Their Fragments" (Chapter 13 in Tumor Imaging: The 
Radiochemical Detection of Cancer, S.W. Burchiel and B. A. Rhodes, eds., 
Masson Publishing Inc. (1982)). 

Treatment 

The Tumor Necrosis Factor (TNF) family ligands are known to be 
among the most pleiotropic cytokines, inducing a large number of cellular 
responses, including cytotoxicity, anti-viral activity, immunoregulatory 
activities, and the transcriptional regulation of several genes (Gocddel. D.V. 
a/., "Tumor Necrosis Factors: Gene Structure and Biological Activities," 
Symp. Quant Biol. 57:597-609 (1986). Cold Spring Harbor; Beutler. B., and 
Cerami, A., Anna. Rev. Biochcm. 57:505-518 (1988); Old, L.J., Sci. Am. 
25*59-75 (1988); Fiers, W\, FEBS Lett. 2*5:1.99-224 (1991)). The TNF- 
family ligands induce such various cellular responses by binding to TNF- 
family receptors. Cells which express a TNFR polypeptide and have a potent 
cellular response to TNFR-5, -6a and 6(3 ligands include lympnocytes. 
endothelial cells, kcratinocytes, and prostate tissue. By "a cellular response to 
a TNF-family liganJ" is intended any genotypic, phenotypic. and or 



morphologic change to a cell, cell line, tissue, tissue culture or patient that is 
induced by a TNF-family ligand. As indicated, such cellular responses include 
not only normal physiological responses to TNF-family Iigands, but also 
diseases associated with increased apoptosis or the inhibition of apoptosis. 

Diseases associated with increased cell survival, or the inhibition of 
apoptosis, include cancers (such as follicular lymphomas, carcinomas with p53 
mutations, and hormone-dependent tumors, such as breast cancer, prostate 
canc.r, Kaposi's sarcoma and ovarian cancer); autoimmune disorders (such as 
systemic lupus erythematosus and immune-related glomerulonephritis 
rheumatoid arthritis) and viral infections (such as herpes viruses, pox viruses 
and adenoviruses), information graft v. host disease, acute graft rejection, and 
chronic graft rejection. Diseases associated with increased apoptosis include 
AIDS; neurodegenerative disorders (such as Alzheimer's disease, Parkinson's 
disease. Amyotrophic lateral sclerosis. Retinitis pigmentosa. Cerebellar 
degeneration); myelodysplasia syndromes (such as aplastic anemia), ischemic 
injury (such as that caused by myocardial infarction, stroke and reperfusion 
injury), toxin-induced liver disease (such as that caused by alcohol), septic 
shock, cachexia and anorexia. 

Thus, in one aspect, the present invention is directed to a method for 
enhancing apoptosis induced by a TNF-family ligand, which involves 
administering to a cell which expresses the TNFR polypeptide an effective 
amount of TNFR polypeptide, analog or an agonist capable of increasing 
TNFR mediated signaling. Preferably, TNFR mediated signaling is increased 
to treat a disease wherein decreased apoptosis is exhibited. Antagonist can 
include soluble forms of TNI- R and monoclonal antibodies directed against the 
TNFR polypeptide. 

By "agonist" is intended naturally occurring and synthetic compounds 
capable of enhancing or potentiating apoptosis. By "antagonist" is intended 
naturally occurring and synthetic compounds capable of inhibiting apoptosis. 
Whether any candidate "agonist" or "antagonist" of the present invention can 
enhance or inhibit apoptosis can be determined using art-known TNF-family 
ligand receptor cciiular response as;;a*s, including those described in more 

detail below. I 

One such screening procedure involves the use of mclanophorcs which 
arc transfectcd to express the receptor of the present invention. Such a 
screening technique is described in PCT W u 92 OIK 10. published. February 6. 
1992. Such an assay may be employed, for example, for screening for a 
compound which inhibits (or enhances) activation of the receptor polypeptide 
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of the present invention by contacting the melanophore cells which encode the 
receptor with both a TNF-family ligand and the candidate antagonist (or 
agonist). Inhibition or enhancement of the signal generated by the ligand 
indicates that the compound is an antagonist or agonist of the hgand/receptor 
5 signaling pathway. 

Other screening techniques include the use of ceLs which express the 
receptor (for example, transfected CHO cells) in a system which measures 
extracellular pH changes caused by receptor activation, for example, as 
described in Science 246:181-296 (October 1989). For example, compounds 
10 *riay be contacted with a cell which expresses the receptor polypeptide of the 

present invention and a second messenger response, e.g.. signal transduction or 
pH changes, may be measured to determine whether the potential compound 
activates or inhibits the receptor. 

Another such screening technique involves introducing RNA encoding 
15 the recepior into Xenopus oocytes to transiently express the receptor. The 

receptor oocytes ma> then be contacted with the receptor ligand and a 
compound to be screened, followed by detection of inhibition or activation of a 
calcium signal in the case of screening for compounds which are thought to 
inhibit activation of the receptor. 
20 Another screening technique involves expressing in cells a construct 

wherein the receptor is linked to a phospholipasc C or D. Such cells include 
endothelial cells, smooth muscle cells, embryonic kidney cells, etc. The 
screening may \*e accomplished as hereinabove described by detecting 
activation of the receptor or inhibition of activation of the receptor from the 
25 phospholipase signal. 

Anothc. method involves screening for compounds which inhibit 
activation of the receptor polypeptide of the present invention antagonists by 
determining inhibition of binding of labeled ligand to cells which have the 
receptor on the surface thereof. Such a method involves transfecting a 
30 cukaryotic cell with DNA encoding the receptor such that the cell expresses 

the receptor on its surface and contacting the cell with a compound in the 
presence of a labeled form of a known ligand. The ligand can be labeled, e.g., 
by radioactivity. The amount of labeled ligand bound to the receptors is 
measured, e.g., by measuring radioactivity of the receptors. If the compound 
35 binds to the receptor as determined by a reduction of labeled ligand which 

binds to the receptors, the binding of labeled ligand to the receptor is inhibited. 
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Further screening assays for agonist and antagonist of the present 
invention are described in Tartaglia, L.A., and Goeddel. D.V., J Biol Chem 
20-7^:4304-4307(1992). 

Thus, in a further aspect, a screening method is provided for 
5 determining whether a candidate agonist or antagonist is capable of enhancing 

or inhibiting a cellular response to a TNF-family iigand. The method involves 
contacting cells which express the TNFR polypeptide with a candidate 
compound and a TNF-family Iigand, assaying a cellular response, and 
comparing the cellular response to a standard cellular response, the standard 
10 being assayed when contact is made with the Iigand in ?bsvnce of the candidate 

compound, whereby an increased cellular response over the standard indicates 
that the candidate compound is an agonist of the ligand/receptor signaling 
pathway and a decreased cellular response compared to the standard indicates 
that the candidate compound is an antagonist of the ligand/receptor signaling 
15 pathway. By "assaying a cellular response" is intended qualitatively or 

quantitatively measuring a cellular response to a candidate compound and/or a 
TNF-family Iigand (e.g., determining or estimating an increase or decrease in T 
cell proliferation or tritiated thymidine labeling). By the invention, a cell 
expressing the TNFR polypeptide can be contacted with either an endogenous 
20 or exogenousty administered TNF-family Iigand. 

Agonist according to the present invention include naturally occurring 
and synthetic compounds such as, for example, TNF family Iigand peptide 
fragments, transforming growth factor, neurotransmitters (such as glutamate, 
dopamine, A'-methyl-D-aspartate), tumor suppressors (p53). cytolytic T cells 
25 and antimetabolites. Preferred agonist include chemothcrapeutic drugs such as, 

for example, cisplatin, doxorubicin, bleomycin, cytosine arabinoside, nitrogen 
mustard, methotrexate and vincristine. Others include ethanol and -amyloid 
peptide. (Science 267: 1457-145K (1995)). Further preferred agonist include 
polyclonal and monoclonal antibodies raised against the TNFR polypeptide. 
30 or a fragment thereof. Such agonist antibodies raised against a TNF family 

receptor arc disclosed in Tartaglia, L.A., et aL. Proc. Sad. Acad. Sci USA 
W:9292-9296 (1991); and Tartaglia. L.A., and Goeddel, D.V., J. Bioi Chem. 
«?/57(7):43O4-4307 (1992) Sec, also. PCT Application WO 94 09137. 

Antagonist according to the present invention include naturally 
35 recurring and synthetic compounds such as. for example, the CD40 Iigand, 

neutral amino acids, zinc, estrogen, androgens, viral genes (such as Adenovirus 
E1B. Baeulovirus f>35 and ZAP, Cowpox virus <-#7#i.-l, Hpstcin-BarT virus 
Bi?%Fl. LMP-I. African swine fever virus Z.A/H5-//L. and Herpesvirus yl 
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34.5), calpain inhibitors, cysteine protease inhibitors, and tumor promoters 
(such as PMA, Phenobarbital. and -Hexachlorocyclohexane). Other 
antagonists include polyclonal and monoclonal antagonist antibodies raised 
against the TNFR polypeptides or a fragment thereof. Such antagonist 
5 antibodies raised against a TNF-family receptor are described in Tartaglia. 

L.A., and Goeddel, D V., J. Biol. Chem. 267(7) :4304-4307 (1992) and 
Tartaglia, L.A. et aL Cell 73:2 1 3-216 (1993). See, also, PCT Application WO 
94/09137. 

Other potential antagonists include antisense molecules. Antisense 
10 technology can be used to control gene expression through antisense DNA or 

RNA or through triple-helix formation. Antisense techniques are discussed, 
for example, in Okano, . Neurochem. 5/5:560 < 1991); Oligodeoxynuck-otides as 
Antisense Inhibitors of Gene Expression. CRC Press, Boca Raton, PL (1988). 
Triple helix formation is discussed in, for instance Lee et aL Nucleic Acids 
15 Research 6:3073 (1979); Cooney et aL Science 241:456 (1988); and Dervan et 

aL Science 257:1360 (1991). The methods are based on binding of a 
polynucleotide to a complementary DNA or RNA. 

For example, the 5' coding portion of a polynucleotide that encodes the 
mature polypeptide of the present invention may be used to design an 
20 antisense RNA oligonucleotide of from about 10 to 40 base pairs in length. A 

DNA oligonucleotide is designed to be complementary to a region of the gene 
involved in transcription thereby preventing transcription and the production 
of the receptor. The antisense RNA oligonucleotide hybridizes to the mRNA 
in vivo and blocks translation of the mRNA molecule into receptor 
25 polypeptide. The oligonucleotides described above can also be delivered to 

cells such that the antisense RNA or DNA may be expressed in vivo to inhibit 
production of the receptor. 

Further antagonist according to the present invention include soluble 
forms of TNFR, i.e., TNFR fragments that include the ligand binding domain 
30 from the extracellular region of the full length receptor. Such soluble forms of 

the receptor, which may be naturally occurring or synthetic, antagonize TNFR 
mediated signaling by competing with the cell surface TNFR for binding to 
TNF-family ligands. Thus, soluble forms of the receptor that include the 
ligand binding domain arc novel cytokines capable of inhibiting tumor necrosis 
35 induced by TNF-family ligands. Other such cytokines arc known in the art 

and include Fas B (a soluble form of the mouse Fas receptor) that acts 
physiologically to limit apoptosis induced by Fas ligand (Hughes. D P. and 
Xnspc, I.N.../. Exp. Med. i*2: 1 395- 140 1 (I995». 
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As indicated polyclonal and monoclonal antibody agonist or antagonist 
according to the present invention can be raised according to the methods 
disclosed in Tartaglia. L.A., and Goeddel. D.V.. J Biol. Chem 267(7, 4**- 
4307(1992); Tartaglia. L A. ct at.. Cell 7J:2I3-2I6 (1993). and PCT 
Application WO 94/09137. The term "antibody" (Ab) or "monoclonal 
anti'sody" (mAb) as used herein is meant to include intact molecules as well as 
fragments thereof (such as. for example. Fab and F(ab') 2 fragments) which are 
capable of binding an antigen. Fab and F (ab') 2 fragments lack the Fc fragment 
of intact antibody, clear more rapidly from the circulation, and may have less 
non-specific tissue binding of an intact antibody (Wahl el at.. J Nucl. Med. 

24:1 16-325 (1983)). 

Antibodies according to the present invention .nay be prepared by any 
of a variety of methods described above, and known in the art 

Proteins and other compounds which bind the extracellular domains are 
also candidate agonist and antagonist according to the present invention. Such 
binding compounds can be "captured" using the yeast two-hybrid system 
(Fields and Song. Nature 540:245-246 ( 1989)). A modified version of the yeast 
two-hybrid system has been described by Roger Brent and his colleagues 
(Gyuris. J. el ai. Cell 75:791-803 (1993); Zervos, A.S. el at.. Cell 72:223-232 
(1993)). 

By a "TNF-family ligand" is intended naturally occurring, recombinant, 
and synthetic ligands that are capable of binding to a member of the TNF 
receptor family and inducing the ligand'rcceptor signaling pathway. Members 
of the TNF ligand family include, but are not limited to, the TNFR-5. -6a & - 
6|3 ligands. TNF-a . lymphdtoxin-a (LT-a . also known as TNF-p ). LT-p\ 
FasL. CD40. CD27. CD30. 4-IBB. OX40 and nerve growth factor (NGF). 

Representative therapeutic applications of the present invention are 
discussed in-more detail below. The state of immunodeficiency that defines 
AIDS is secondary to a decrease in the number and function of CD4" 
T-lymphocytes. Recent reports estimate the daily loss of CD4" T cells to be 
between 3.5 X 107 and 2 X 109 cells (Wei X.. el ai. Suture J7J.1 17-122 
(1995)). One cause of CD4' T cell depletion in the setting of HIV infection is 
believed to be IllV-induccd apopiosis. Indeed. HIV-induccd apoptotic cell 
death has been demonstrated not only in vitro but also, more importantly, in 
infected individuals (Ameiscn. J.C.. AIDS *:\ 197-1213 (1994) ; Kinkel. T.U.. 
and Banda, N.K., Curr. Opin. Immunol. rt:605-615( 1995); Muro-Cacho. C A 
etal .J. Immunol. /5-/:5555-5566 ( 1995)). Furthermore, apoptosis and CW 
- T-lyihphocytc depletion is tightly correlated in differcn. animal models of 
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AIDS (Brunner, T., et al„ Nature 373:441-444 (1995); Gougeon, ML., et aL 
AIDS Res. Hum. Retroviruses 9:553-563 (1993)) and, apoptosis is not 
observed in those animal models in which viral replication does not result in 
AIDS (Gougeon, ML. et at.. AIDS Res. Hum. Retroviruses 9:553-563 (1993)). 

5 Further data indicates that uninfected but primed or activated T lymphocytes 

from HIV-infected individuals undergo apoptosis after encountering the TNF- 
family ligand FasL. Using monocytic cell lines that result in death following 
HIV infection, it has been demonstrated that infection of U937 cells with HIV 
results in the de novo expression of FasL and that FasL mediates HIV-induced 

10 apoptosis (Badley, A.D. et al. J. Virol. 70:199-206 (1996)). Further the 

TNF-family ligand was detectable in uninfected macrophages and its 
expression was upregulated following HIV infection resulting in selective 
killing of uninfected CD4 T-lymphocytes (Badley, A.D et al., J. Virol 70: 1 99- 
206 (1996)). Thus, by the invention, a method for treating HIV* individuals is 

15 provided which involves administering an antagonist of the present invention 

to reduce selective killing of CD4 T-lymphocytes. Modes of administration 
and dosages are discussed in detail below. 

In rejection of an allograft, the immune system of the recipient animal 
has not previously been primed to respond because the immune system for the 

20 most part is only primed by environmental antigens. Tissues from other 

members of the same species have not been presented in the same way that, 
for example, viruses and bacteria have been presented. In the cast of allograft 
rejection, immunosuppressive regimens are designed to prevent the immune 
system from reaching the effector stage. However, the immune profile of 

25 xenograft rejection may resemble disease recurrence more that allograft 

rejection. In the case of disease recurrence, the immune system has already 
been activated, as evidenced by destruction of the native islet cells. Therefore, 
in disease recurrence the immune system is already at the effector stage. 
Agonist of the present invention are able to suppress the immune response to 

30 both allografts and xenografts because lymphocytes activated and 

differentiated into effector cells will express the TNFR polypeptide, and 
thereby are susceptible to compounds which enhance TNFR activity. Thus, 
the present invention further provides a method for creating immune privileged 
tissues. Antagonist of the invention can further be used in the treatment of 

35 Inflammatory Bowel-Disease. 
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Formulations 

The TNFR polypeptide composition will be formulated and dosed in a 
fashion consistent with good medical practice, taking into account the clinical 
5 condition of the individual patient (especially the side effects of treatment 

' with TNFR-5, -6a or -6P polypeptide alone), the site of delivery of the 
TNFR polypeptide composition, the method of administration, the scheduling 
of administration, and other factors known to practitioners. The "effective 
amount" of TNFR polypeptide for purposes herein is thus determined by 

io such considerations. 

As a general proposition, the total pharmaceutically effective amount 
of TNFR polypeptide administered parenterally per dose will be in the range 
of about 1 ug/kg/day to 10 mg/kg/day of patient body weight, although, as 
noted above, this will be subject to therapeutic discretion. More preferably, 
1 5 this dose is at least 0.0 1 mg/kg/day, and most preferably for humans between 

about 0.0 1 and 1 mg/kg/day for the he imone. If given continuously, the TNFR 
polypeptide is typically administered at a dose rate of about 1 ug/kg/hour to 
about 50 jig 'kg/hour, either by 1-4 injections per day or by continuous 
subcutaneous infusions, for example, using a mini-pump. An intravenous bag 
20 solution may also be employed. The length of treatment needed to observe 

changes and the interval following treatment for responses to occur appears to 
vary depending on the desired effect. 

Pharmaceutical compositions containing the TNFR of the invention 
may be administered orally, rectally, parenterally, intracistemally, 
25 intravaginally, intraperitoneal^, topically (as by powders, ointments, drops or 

transdermal patch), bucally, or as an oral or nasal spray. By 
"pharmaceutically acceptable carrier" is meant a non-toxic solid, semisolid or 
liquid filler, diluent, encapsulating material or formulation auxiliary of any 
type. The term "parenteral" as used herein refers to modes of administration 



^ y . .... 

' V*" 




which include intravenous, intramuscular, intraperitoneal, intrasternal, 
subcutaneous and intraarticular injection and infusion. 

The TNFR polypeptide is also suitably administered by sustained- 
release systems. Suitable examples of sustained-release compositions include 
semi-permeable polymer matrices in the form of shaped articles, e.g., films, or 
mirocapsules. Sustained-release matrices include polylactides (U.S. Pat. No. 
3,773,919, EP 58,481), copolymers of L-glutamic acid and gamma-ethyl- L- 
glutamate (Sidman. U. et al., Biopolymers 22:547-556 (1983)), poly (2- 
hydroxyethyl methacrylate) (R. Langer et al., J. Biomed. Mater. Res. 75:167- 
277 (1981), and R. Langer, Chem. Tech. 72:98-105 (1982)), ethylene vinyl 
acetate (R. Langer et al. Id.) or poly-D- (-)-3-hydroxybutyric acid (EP 
133.988). Sustained-release TNFR polypeptide compositions also include 
liposomal ly entrapped TNFR polypeptides. Liposomes containing TNFR 
polypeptides are prepared by methods known per se: DE 3,218,121; Epstein 
ct al., Proc. Natl. Acad. Sci. (USA) 82:3688-3692 (1985); Hwang et al., Proc. 
Natl. Acad. Sci. (USA) 77:4030-4034 (1980); EP 52,322; EP 36,676; EP 
88,046; EP 143,949; EP 142,641 ; Japanese Pat. Appl. 83-118008; U.S. Pat. 
Nos. 4,485,045 and 4,544,545; and EP i 02,324. Ordinarily, the liposomes are 
of the small (about 200-800 Angstroms) unilamellar type in which the lipid 
content is greater than about 30 mol. percent cholesterol, the selected 
proportion being adjusted for the optimal TNFR polypeptide therapy. 

Fr' parenteral administration, in one embodiment, the TNFR 
polypeptide is formulated generally by mixing it at the desired degree of 
purity, in a unit dosage injectable form (solution, suspension, or emulsion), 
with a pharmaceutical^ acceptable carrier, i.e., one that is non-toxic to 
recipients at the dosages and concentrations employed and is compatible with 
other ingredients of the formulation. For example, the formulation preferably 
docs not include oxidizing agents and other compounds that are known to be 
deleterious to potypL-pti ics. 
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hydrophilic polymers such as polyvinylpyrrolidone; amino acids, such js 
glycine, glutamic acid, aspartic acid, or arginine; monosaccharides, 
disaccharides, and other carbohydrates including cellulose or its derivatives, 
glucose, manosc, or dcxtrins; chelating agents such as EDTA; sugar alcohols 
such as mannitol or sorbitol; counterions such as sodium; and/or nonionic 
surfactants such as polysorbates, poloxamers, or PEG. 

The TNFR polypeptide is typically formulated in such vehicles at a 
concentration of about 0.1 mg/ml to 100 mg/ml. preferably I -10 mg ml. at a pH 
of about 3 to 8. It will be understood that the use of certain or the foregoing 
cxcipicnts, carriers, or stabilizers will result in the formation of TNFR 

polypeptide salts. 

TNFR polypeptides to be used for therapeutic administration must be 
sterile. Sterility is readily accomplished by filtration through sterile filtration 
membranes (e.g., 0.2 micron membranes). Therapeutic TNFR polypeptide 
- - compositions generally arc placed into a container having a sterile access port. 
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Generally, the formulations are prepared by contacting the TNFR £r^^ 
polypeptide uniformly and intimately with liquid carriers or finely divided 
solid carriers or both. Then, if necessary, the product is shaped into the 
desired formulation. Preferably the carrier is a parenteral carrier, more r 
5 preferably a solution that is isotonic with the blood of the recipient. Examples 

of such carrier vehicles include water, saline. Ringer's solution, and dextrose 

>lution. Non-aqueous vehicles such as fixed oils and ethyl oleate are also 
useful herein, as well as liposomes. 

The carrier suitably contains minor amounts of additives such as 
10 substances that enhance isotonicity and chemical stability. Such materials are — 

non-toxic to recipients at the dosages and concentrations employed, and 
include buffers such as phosphate, citrate, succinate, acetic acid, and other 

organic acids or their salts; antioxidants such as ascorbic acid; low molecular ^ m m mm 

weight (less than about ten residues) polypeptides, e.g., poly arginine or ^ 
1 5 tripeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; Jslv* 
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for example, an intravenous solution bag or vial having a stopper pierceable by 
a hypodermic injection needle. 

TNFR polypeptides ordinarily will be stored in unit or multi-dose 
containers, for example, sealed ampoules or vials, as an aqueous solution or as 
a lyophihzed formulation for reconstirution. As an example or* a lyophihzed 
formulation. 10-ml vials are filled with 5 ml of sterile-filtered 1% (w v) 
aqueous TNFR polypeptide solution, and the resulting mixture is lyophihzed. 
The infusion solution is prepared by reconstituting the lyophihzed TNFR 
polypeptide using bacteriostatic Water-for-Injection. 

The invention also provides a pharmaceutical pack or kit comprising 
one or more containers filled with one or more of the ingredients of the 
pharmaceutical compositions of the invention. Associated with ;"ch 
ccntainer(s) can be a notice in the form prescribed by a governmental agency 
regulating the manufacture, use or sale of pharmaceuticals or biological 
products, which notice reflects approval by the agency of manufacture, use or 
sale for human administration. In addition, the polypeptides of the present 
invention may be employed in conjunction with other therapeutic compounds. 



♦ 



- V ■ 



:<> 



Chromosome Assays 

The nucleic acid molecules of the present invention are also valuable for 
chromosome identification. The sequence is specifically targeted to and can 
hybridize with a particular location on an individual human chromosome. 
Moreover, there is a current need for identifying particular sites on the 
chromosome. Few chromosome marking reagents based on actual sequence 
data (repeat polymorphism.) are presently available for marking chromosomal 
location. The mapping of DNAs to chromosomes according to the present 
invention is an important first step in correlating those sequences with genes 
associated with disease. 

In certain preferred embodiments in this regard, the cDNAs herein 
disclosed are used to clone genomic DNA of a TNFR protein gene. This can 
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be accomplished using a variety of well known techniques and libraries, which 
generally are available commercially. The genomic DN A then is used for in 
situ chromosome mapping using well known techniques for this purpose. 

In addition, in some cases, sequences can be mapped to chromosomes 
by preparing PCR primers (preferably 15-25 bp) from the cDNA. Computer 
analysis of the 3' untranslated region of the gene is used to rapidly select 
primers that do not span more than one exon in the genomic DNA, thus 
complicating the amplification process. These primers are then used for PCR 
screening of somatic cell hybrids containing individual human chromosomes. 
Fluorescence in situ hybridization ("FISH") of a cDNA clone to a metaphase 
chromosomal spread can be used to provide a precise chromosomal location in 
one step. This technique can be used with probes from the cDNA as short as 
50 or 60 bp. For a review of this technique, see Verma ct al. Human 
Chromosomes: A Manual Of Basic Techniques. Pcrgamon Press. New York 
(1988). 

Once a sequence has been mapped to a precise chromosomal location, 
the physical position of the sequence on the chromosome can be correlated 
with genetic map data. Such data are found, for example, in V. McKusick, 
Memielian Inheritance In Man. available on-line through Johns Hopkins 
University. Welch Medical Library. The relationship between genes and 
diseases that have been mapped to the same chromosomal region arc then 
identified through linkage analysis (coinhentance of physically adjacent genes). 

Next, it is necessary to determine the differences in the cDNA or 
genomic sequence between affected and unaffected individuals. If a mutation is 
observed in some or all of the affected individuals but not in any normal 
individuals, then the mutation is likely to be the causative agent of the disease. 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by 
way of illustration and are not intended as limiting. 
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Examples 

Example 1(a): Expression and Purification of "His-taf>Red« Extracellular 

form of TNFR-5 in E. coli 

The bacteria! expression vector pQE9 (pDIO) is used for bacterial 
expression in this example. (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth. 
CA, 9131 1). pQE9 encodes ampicillin antibiotic resistance ( M Ampr M ) and 
contains a bacterial origin of replication ("ori"). an IPTG inducible promoter, a 
ribosome binding site ("RBS"), six codons encoding histidine residues that 
allow affinity purification using nickel-nitrilo-tri-acetic acid ("Ni-NTA") 
affinity resin sold by QIAGEN, Inc., supra, and suitable single restriction 
enzyme cleavage sites. These elements are arranged such that an inserted 
DNA fragment encoding a polypeptide expresses that polypeptide with the 
six His residues (i.e., a "6 X His tag") covalently linked to the amino terminus 
of that polypeptide. 

The DNA sequence encoding the desired portion of the TNFR-5 
protein comprising the extracellular form of the TNFR-5 amino acid sequence 
is amplified from the deposited cDN A clone using PCR oligonucleotide 
primers which anneal to sequence encoding the amino terminal sequences of 
the desired portion of the TNFR-5 protein and to carboxy terminal sequences 
of the desired portion of the extracellular form of the TNFR-5 protein in the 
deposited cDNA. Additional nucleotides containing restriction sites to 
facilitate cloning in the P QE9 vector are added to the 5' and 3* primer 
sequences, respectively. 

For cloning the extracellular form of the TNFR-5 protein, the 5" primer 
has the sequence 5' CGC GGATCC ACCACTGCCCGGCAGGAG 3- (SEQ 
ID NO: 25) containing the underlined Bamlll restriction site followed by IK 
nucleotides of the amino terminal coding sequence of the extracellular TNFR-5 
sequence in SEQ ID NO:2. One of ordinary skill in the art would appreciate, 
' - of course, that the point in the protein coding sequence where the 5' primer 



begins and where the 3" primer ends may be varied to amplify a DNA segment 
encoding any desired portion of the complete TNFR-5 protein shorter or 
longer than the extracellular form of the protein. The 3 * primer has the 
sequence 5' GCGTCTAGACTAGTAATGAGAAGAGGCAGG 3' (SEQ ID 
NO:26) containing the undt-ilined Xbal restriction site followed by 18 
nucleotides complementary to the 3' end of cDNA encoding the extracellular 
domain of the TNFR-5 protein in Figure 1 . 

The amplified TNFR-5 DNA fragment and the vector pQE9 are 
digested with BamHI and Xbal and the digested DNAs are then ligated 
together. Insertion of the TNFR-5 DNA into the restricted pQE9 vector 
places the TNFR-5 protein coding region downstream from the IPTG- 
inducible promoter and in-frame with an initiating AUG and the six histidine 
codons. 

The ligation mixture is transformed into competent E. coli cells using 
standard procedures such as those described in Sambrook et a!.. Molecular 
Cloning: a Laboratory- Manual. 2nd Ed.; Cold Spring Harbor Laboratory 
Press. Cold Spring Harbor. NY (1989). E. coli strain Ml 5'rep4. containing 
multiple copies of the plasmid pREP4. which expresses the lac repressor and 
confers kanamycin resistance ("Kanr"). is used in carrying out the illustrative 
example described herein. This strain, which is only one of many that are 
suitable for expressing TNFR-5 protein, is available commercially from 
Q1AGEN. Inc.. supra. Transformants arc identified by their ability to grow 
on LB plates in the presence of ampicillin and kanamycin. Plasmid DNA is 
isolated from resistant colonics and the identity of the cloned DNA confirmed 
by restriction analysis. PCR and DNA sequencing. 

Clones containing the desired constructs arc grown overnight ("ON") 
in liquid culture in LB media supplemented with both ampicillin (J00 ug ml) 
and kanamycin (25 ug ml). The O N culture is used to inoculate a large 
- - culture, at a dilution of approximately 1 :25 to 1 .250. The cells arc grown to an 
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optical density at 600 nm ("OD600") of between 0.4 and 0.6. Isopropyl-(i-D- 
thiogalactopyranoside ("IPTG") is then added to a final concentration of 1 
mM to induce transcription from the lac repressor sensitive promoter, by 
inactivating the lad repressor. Cells subsequently are incubated further for 3 
to 4 hours. Cells then are harvested by cenrrifugation. 

The cells are then stirred for 3-4 hours at 4° C in 6M guanidine-HCI, 
pH 8. The cell debris is removed by centrifugation, and the supernatant 
containing the TNFR-5 is loaded onto a nickel-nitrilo-tri-acetic acid ("Ni- 
NTA") affinity resin column (available from QIAGEN, Inc., supra). Proteins 
with a 6 x His tag bind to the Ni-NTA resin with high affinity and can be 
purified in a simple one-step procedure (for details see: The QI AexpressionisU 
1995, QIAGEN, Inc., supra). Briefly the supernatant is loaded onto the 
column in 6 M guanidine-HCI, pH 8, the column is first washed with 10 
volumes of 6 M guanidine-HCI, pH 8, then washed with 10 volumes of 6 M 
guanidine-HCI pH 6, and finally the TNFR-5 is eluted with 6 M guanidine- 
HCI, pH 5. 

The purified protein is then renatured by dialyzing it against 
phosphate-buffered saline (PBS) or 50 mM Na-acctate, pH 6 buffer plus 200 
mM NaCl. Alternatively, the protein can be successfully refolded while 
immobilized on the Ni-NTA column. The recommended conditions are as 
follows: rcnature using a linear 6M-IM urea gradient in 500 mM NaCl, 20% 
glycerol, 20 mM Tris/HCI pH 7.4, containing protease inhibitors. The 
rcnaturation should be performed over a period of 1 .5 hours or more. After 
rcnaturation the proteins can be eluted by the addition of 250 mM immidazote. 
Immidazolc is removed by a final dialyzing step against PBS or 50 mM 
sodium acetate pH 6 buffer plus 200 mM NaCl. The purified protein is stored 
at4°Corfrozcnat-K0°C 




tf-~: 



J. 



ft « 



■6S- 



10 



15 



20 



25 



Example 1(b): Expression and Purification of TNFR-6a and -6/J in E. 
coli 

The bacterial expression vector pQE60 is used for bacterial expression 
in this example (QIAGEN, Inc., 9259 Eton Avenue. Chatsworth, CA, 9131 1). 
pQE60 encodes ampicillin antibiotic resistance ("Ampr") and contains a 
bacterial origin of replication ("cri"), an IPTG inducible promoter, a ribosome 
binding site ("RBS"), six codons encoding histidine residues that allow affinity 
purification using nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin sold 
by QIAGEN. Inc.. supra, and suitable single restriction enzyme cleavage sites. 
These elements are arranged such that a DN A fragment encoding a 
polypeptide may be inserted in such as way as to produce that polypeptide 
with the six His residues (i.e.. a "6 X His tag") covalently linked to the 
carboxyl terminus of that polypeptide. However, in this example, the 
polypeptide coding sequence is inserted such that translation of the six His 
codons is prevented and. therefore, the polypeptide is produced with no 6 X 
His tag. 

The DNA sequences encoding the desired portions of TNFR-6ot and - 
6p proteins comprising the mature forms of the TNFR-6a and -6p amino acid 
sequences arc amplified from the deposited cDNA clones using PCR 
ohgonuclcotide primers which anneal to the am.no terminal sequences of the 
desired portions of the TNFR-5. -6a or -6(1 proteins and to sequences in the 
deposited constructs 3" to the cDNA coding sequence. Additional nucleotides 
containing restriction sites to facilitate cloning in the pQE60 vector are added 
to the 5" and 3* sequences, respectively. 

For cloning the mature form of the TNFR-6ct protein, the 5' primer 
has the sequence 5" CGC CCATGG CAGAAACACCCACCTAC 3' (SEQ ID 
NO:27) containing the underlined Ncol restriction site. One of ordinary skill 
in the art would appreciate, of course, that the point in t!ic protein coding 
' " sequence where the 5' primer begins may be varied to amplify a desired 
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portion of the complete protein shorter or longer than the mature form. The 3' 
primer has the sequence 5* CGC AACCTT CTCTTTCAGTGCAACTG 3' 
(SEQ ID NO:28) containing the underlined Hindlll restriction site. For cloning 
the mature form of the TNFR-6P protein, the 5' primer has the sequence of 
SEQ ID NO:2^ above, and the 3* primer has the sequence 5' 
CGC AAGCTT CTCCTCAGCTCCTGCAGTG 3' (SEQ ID NO:29) 
containing the underlined Hindlll restriction site. 

The amplified TNFR-6a and -6p DNA fragments and the vector 
pQE60 are digested with Ncol and Hindlll and the digested DNAs are then 
ligated together. Insertion of the TNFR-6ot and -6p DNA into the restricted 
pQE60 vector places the TNFR-6ot and -6p protein coding region including its 
associated stop codon downstream from the IPTG-inducible promoter and in- 
frame with an initiating AUG. The associated stop codon prevents translation 
of the six histidine codons downstream of the insertion point. 

The ligation mixture is transformed into competent £. coli cells using 
standard procedures such as those described in Sambrook et al.. Molecular 
Cloning: a Laboratory Manual. 2nd Ed. ; Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY (1989). E. colt strain M15/rep4, containing 
multiple copies of the plasmid pREP4, which expresses the lac repressor and 
confers kanamycin resistance ("Kanr"), is used in carrying out the illustrative 
example described herein. This strain, w hich is only one of many that are 
suitable for expressing TNFR-6a or -6p protein, is available commercially 
from QIAGEN, Inc.. supra. Transformants arc identified by their ability to 
grow on LB plates in the presence of ampicillin and kanamycin. Plasmid DNA 
is isolated from resistant colonics and the identity of the cloned DNA 
confirmed by restriction analysis. PCR and DNA sequencing. 

Clones containing the desired constructs arc grown overnight f'O N") 
in liquid culture in LB media supplemented with both ampicillin ( 100 ug ml) 
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and kanamycin (25 pg/ml). The O/N culture is used to inoculate a large 
culture, at a dilution of approximately 1 :25 to 1 :250. The cells are grown to an 
optical density at 600 nm ("OD600") of between 0.4 and 0.6. isopropyl-b-D- 
thiogalactopyranoside ("IPTG") is then added to a final concentration of 1 
mM to induce transcription from the lac repressor sensitive promoter, by 
inactivating the lacl repressor. Cells subsequently are incubated further for 3 
to 4 hours. Cells then are harvested by centrifugation. 

To purify the TNFR-6a and -60 polypeptide, the cells are then stirred 
for 3-4 hours at 4° C in 6M guanidine-HCl, pH 8. The cell debris is removed 
by centrifiigation, and the supernatant containing the TNFR-6a and -6|J is 
dialyzed against 50 mM Na-acetate bufTer pH 6, supplemented with 200 mM 
Nad. Alternatively, the protein can be successfully refolded by dialyzing it 
against 500 mM NaCl, 20% glycerol. 25 mM Tris/HCl pH 7.4, containing 
protease inhibitors. After renaruration the protein can be purified by ion 
exchange, hydrophobic interaction and size exclusion chromatography. 
Alternatively, an atTinity chromatography step such as an antibody column 
can be used to obtain pure TNFR-6a and -6(J protein. The purified protein is 
stored at 4'X or frozen at -80° C. 

The following alternative method may be used to punfy TNFR-5, -*>ct 
or -6P expressed in £ coii when it is present in the form of inclusion bodies. 
Unless otherw ise specified, all of the following steps arc conducted at 4-l0 c C. 

Upon completion of the production phase of the £. coii fermentation, 
the cell culture is cooled to 4-tP°C and the cells are harvested by continuous 
centrifugation at 15,000 rpm (llcracus Scpatcch). On the basis of the expected 
yield of protein per unit weight of cell paste and the amount of purified 
protein required, an appropriate amount of cell paste, by weight, is suspended 
in a buffer solution containing 100 mM Tns, 50 mM fcDTA, pH 7.4. The 
cells are dispersed to a homogeneous suspension using a high shear mixer. 
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The cells ware then lysed by passing the solution through a 
micronuidizer (Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 
psi. The homogenate is then mixed with NaCl solution to a final concentration 
of 0.5 M NaCl, followed by centrifugation at 7000 xg for 15 min. The 
resultant pellet is washed again using 0.5M NaCl, 100 mM Tris, 50 mM 
EDTA, pH 7.4. 

The resulting washed inclusion bodies are solubilized with 1.5 M 
guanidine hydrochloride (GuHCI) for hours. After 7000 xg centrifugation 
for 15 min., the pellet is discarded and the TNFR-5, -6a or -6p 
polypeptide-containing supernatant is incubated at 4°C overnight to allow 

further GuHCI extraction. 

Following high speed centrifugation (30.000 xg) to remove insoluble 
particles, the GuHCI solubilized protein is refolded by quickly mixing the 
GuHCI extract with 20 volumes of buffer containing 50 mM sodium, pH 4.5, 
150 mM NaCl, 2 mM EDTA by vigorous stirring. The refolded diluted 
protein solution is kept at 4°C without mixing for 12 hours prior to further 

purification steps. 

To clarify the refolded TNF receptor polypeptide solution, a 
previously prepared tangential filtration unit equipped with 0.16 \xm 
membrane filter with appropriate surface area (e.g., Filtron), equilibrated with 
40 mM sodium acetate, pH 6.0 is employed. The filtered sample is loaded 
onto a cation exchange resin (e.g., Poros I IS-50, Perscptive Biosystems). The 
column is washed with 40 mM sodium acetate, pH 6.0 and cluted with 250 
mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same buffer, in a 
stepwise manner. The absorbance at 2K0 mm of the effluent is continuously 
monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the TNF receptor polypeptide are then pooled 
and mixed with 4 volumes of water. The diluted sample is then loaded onto a 
* previously prepared set of tandem columns of strong anion (Poros HQ-50, 



Perseptivc Biosystems) and weak anion (Poros CM-20, Pcrscptivc 
Biosystems) exchange resins. The columns are equilibrated with 40 mM 
sodium acetate, pH 6.0. Both columns are washed with 40 mM sodium- 
acetate, pH 6.0. 200 mM NaCl. The CM-20 column is then eluted using a 10 
column volume linear gradient ranging from 0.2 M NaCl, 50 mM sodium 
acetate, pH 6.0 to 1.0 M NaCl, 50 mM sodium acetate, pH 6.5. Fractions are 
collected under constant A >0 monitoring of the effluent. Fractions containing 
the TNFR-5, -6a or -6(J polypeptide (determined, for instance, by 16% SDS- 

PAGE) arc then pooled. 

The resultant TNF receptor polypeptide exhibits greater than 95% 
punty after the above refolding and purification steps. No major contaminant 
bands are observed from Commassie blue stained 16% SDS-PAGE gel when 5 
ug of purified protein is loaded. The purified protein is also tested for 
endotoxin/LPS contamination, and typically the LPS content is less than 0.1 
ng'ml according to LAL assays. 

Example 2: Cloning and Expression ofTSFR-S. -6a and -6f$ proteins in a 
Baculovirus Expression System 

In this illustrative example, the plasmid shuttle vector pA2 is used to 
insert the cloned DNA encoding complete protein, including its naturally 
associated secretory signal (leader) sequence, into a baculovirus to express the 
mature TNFR-5, -6a or -6[i protein, using standard methods as described in 
Summers et ai. A Manual of Methods for Baculovirus Vectors and Insect Cell 
Culture Procedures. Texas Agricultural Experimental Station Bulletin No. 
I 555 (I^X7» This expression vector contains the strong polyhedrin promoter 
of the Autozrapha cahjornua nuclear polyhedrosis virus (AcMNPV) followed 
by convenient restriction sites such as Kami II. Xba I and Asp71S. The 
polyadenylation site of the simian virus 40 <"SV40") <* used for efficient 
polyadcnylation. For easy selection of recombinant virus, the plasmid 
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Contains the beta-galactosidasc gene from £. coti under control of a weak 
Drosophila promoter in the same orientation, followed bv the polyadenylation 
signal of the potyhedrin gene. The inserted genes are flanked on both sides by 
viral sequences for cell-mediated homologous recombination with w ild-type 
viral DNA to generate a viable virus that express the cloned polynucleotide. 

Many other baculovirus vectors could be used in place of the vector 
above, such as P Ac373. pVL941 and pAcIMU as one skilled in the art would 
readily appreciate, as long as the construct provides appropriately located 
signals for transcription, translation, secretion and the like, including a signal 
peptide and an in-frame AUG as required. Such vectors are described, for 
instance, in Luckow ct al.. Virology 1 70:3 1-39 ( 1989). 

The cDNA sequence encoding the full length TNFR-5, -6a or -6(i 
protein in a depos.ted clone, including the AUG initiation codon and the 
naturally associated leader sequence shown in SEQ ID NO:2. 4, or 6 is 
amplified using PCR oligonucleotide pnmcrs corresponding to the 5* and 3' 
sequences of the gene. The 5' primer for TNFR-5 has the sequence 
5 - CGCTCTAGACCGCCATCATGGCCCGGArCCCCAAG 3' (SEQ ID 
NO:30) containing the underlined Xbal restriction enzyme site. The 5* primer 
for TNFR-6a and -f»P has the sequence 

5'CGC GGATCC GCCATCATGAGGGCGTGGAGGGGCCAG 3' (SEQ 
ID NO:3 1 ) containing the underlined Rami II restriction enzyme site. All of 
the previously dcscrihcdprimcrs encode an efficient signal for initiation of 
translation in eukaryotic cells, as described by Kozak. M.. 7. Mot Biot 
/Vrt:947-950 ( l*>K7). The 3* primer for TNFR-5 has the sequence 
5- GCG TCTAGA CTAGTAATGAGAAGAGGCAGG V (SEQ ID NO:32> 
containing the underlined Xbal restriction site. The .V primer for TNFR-6Q 
has the sequence 5' CCiC CGTACC CTCTTTC AGTGCAAGTG 3* (SEQ ID 
NO:33) containing the underlined Asp7lK restriction site. The 3* primer for 
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TNFR-6P has the sequence 

5' CGCGGTACCCTCCTCAGCTCCTGCAGTG 3' (SEQ ID NO:34) 
containing the underlined Asp718 restriction site. 

The amplified fragment is isolated from a 1% agarose gel using a 
commercially avaitable kit ("Geneclean," BIO 101 Inc., La Jolla. Ca.). The 
fragment then is digested with the appropriate restriction enzyme for each of 
the primers used, as specified above, and again is purified on a 1% agarose gel. 

The plasmid is digested with the same restriction enzymes and 
optionally, can be dephosphorylated using calf intestinal phosphatase, using 
routine procedures known in the art. The DNA is then isolated from a 1% 
agarose gel using a commercially available kit ("Geneclean" BIO 10 1 Inc.. La 
Jolla. Ca.). 

The fragment and dephosphorylated plasmid are ligated together with 
T4 DNA ligase. E. coli HBI01 or other suitable E. coli hosts such as XL-1 
Blue (Statagene Cloning Systems. La Jolla, CA) cells are transformed with the 
ligation mixture and spread on culture plates. Bacteria are identified that 
contain the plasmid with the human TNF receptor gene by digesting DNA 
from individual colonies using the enzymes used immediately above and then 
analyzing the digestion product by gel electrophoresis. The sequence of the 
c'.oncd fragment is confirmed by DNA sequencing. This plasmid is designated 
herein P A2-TNFR-5. pA2-TNFR-6a or pA2TNFR-6p (collectively pA2- 
TNFR). 

Five ug or the plasmid pA2-TNFR is co-transfected with 1 .0 ug of a 
commercially available linearized baculoviru* DNA ("BaculoGold™ 
baculovirus DNA". Phnnningen. San Diego. CA). using the lipofection method 
described by Feigner ct al.. Pn.c. Nail. Acad. Sci. USA *4: 7413-7417 (19R7). 
One ug of BaculoGold™ virus DNA and 5 ug of the plasmid pA2-TNFR arc 
mixed in a sterile well of a microliter plate containing 50 ul of serum-free 
• Grace's medium (Life Technologies Inc.. Gaithersburg. MD). Afterward*. 10 
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ul Lipofectin plus 90 ul Grace s medium are added, mixed and incubated for 15 
minutes at room temperature. Then the transfection mixture is added drop- 
wise to Sf9 insect cells (ATCC CRL 1711) seeded in a 35 mm tissue culture 
plate with 1 ml Grace's medium without serum. The plate is then incubated 
for 5 hours at 27° C. The transfection solution is then removed from the plate 
and 1 ml of Grace's insect medium supplemented with 10% fetal calf serum is 
added. Cultivation is then continued at 27° C for four days. 

After four days the supernatant is collected and a plaque assay is 
performed, as described by Summers and Smith, supra. An agarose gel with 
"Blue Gal" (Life Technologies Inc., Gaithersburg) is used to allow easy 
identification and isolation of gal-expressing clones, which produce blue- 
stained plaques. (A detailed description of a "plaque assay" of this type can 
also be found in the user's guide for insect cell culture and baculovirology 
distributed by Life Technologies Inc.. Gaithersburg, page 9-10). After 
appropriate incubation, blue stained plaques are picked with the tip of a 
micropipettor (e.g., Eppendorf). The agar containing the recombinant viruses 
is then resuspended in a microcentrifuge tube containing 200 ul of Grace's 
medium and the suspension containing the recombinant baculovin* is used to 
infect Sf9 cells seeded in 35 mm dishes. Four days later the supematants of 
these culture dishes are harvested and then they arc stored at 4° C. 

To verify the expression of the TNF receptor gene Sf9 cells are grown 
in Grace's medium supplemented with I0°i heat-inactivated FBS. The cells 
are infected with the recombinant baculovirus at a multiplicity of infection 
("MOI") of about 2. If radiolabeled proteins are desired. 6 hours later the 
medium is removed and is replaced with SF900 II medium minus methionine 
and cysteine (available from Life Technologies Inc., Rockville. MD). After 42 
hours, 5 nCi of ' 'S-methionine and 5 uCi "S-cysteine (available from 
Amersham) art added. The cells are funher incubated for 16 hours and then 
• arc harvested by ccntrifugation. The proteins in the supernatant as well as the 
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intracellular proteins are analyzed by SDS-PAGE followed by 
autoradiography (if radiolabeled). 

Microsequencing of the amino acid sequence of the amino terminus of 
purified protein may be used to determine the amino terminal sequence of the 
mature form of the TNF receptor protein. 

Example 3: Cloning and Expression ofTNFR-5. -6a and -6f3 in 

Mammalian Cells 

A typical mammalian expression vector contains the promoter element, 
which mediates the initiation of transcription of mRNA, the protein coding 
sequence, and signals required for the termination of transcription and 
polyadenylation of the transcript. Additional elements include enhancers. 
Kozak sequences and intervening sequences flanked by donor and acceptor 
sites for RNA splicing. Highly efficient transcription can be achieved with the 
early and late promoters from SV40, the long terminal repeats (LTRs) from 
Retroviruses, e.g.. RSV. HTLVI. HI VI and the early promoter of the 
cytomegalovirus (CMV). However, cellular elements can also be used (e.g., 
the human actin promoter). Suitable expression vectors for use in practicing 
the present invention include, for example, vectors such as pSVL and pMSG 
(Pharmacia. Uppsala. Sweden), pRSVcat (ATCC 37152), pSV2dr.fr (ATCC 
37146) and pBC12Ml (ATCC 67109). Mammalian host cells that touldbe 
used include, human Hela. 293. H9 and Jurkat cells, mouse NIH3T3 and C127 
cells. Cos 1, Cos 7 and CV1. quail QC1-3 cells, mouse L cells and Chinese 
hamster ovary (CHO) cells. 

Alternatively, the gene can be expressed in stable cell lines that contain 
the gene integrated into a chromosome. The co-transfection with a selectable 
marker such as dhfr. gpt. neomycin, hygromycm allows the identification and 
isolation of the transfected cells. 



The transfected gene can also be amplified to express large amounts of y -J 



the encoded protein. The DHFR (dihydrofolate reductase) marker is useful to ' $ 

develop cell lines that carry several hundred or even several thousand copies of 
the gene of interest. Another useful selection marker is the enzyme glutamine 
synthase (GS) (Murphy et aL Biochem J. 227:277-279 (1991 ); Bebbington et 
al.. Bio/Technology- 70:169-175 (1992)). Using these markers, the mammalian 
cells are grown in selective medium and the cells with the highest resistance are 
selected. These cell lines contain the amplified gene(s) integrated into a 
chromosome. Chinese hamster ovary (CHO) and NSO cells are often used for 
the production of proteins. 

The expression vectors pC 1 and pC4 contain the strong promoter 
(LTR) of the Rous Sarcoma Virus (Cullen et aL Molecular and Cellular 
Biology . 438-447 (March, 1985)) plus a fragment of the CMV-enhancer 
{Boshart et aL Cell 47:521-530 (1985)). Multiple cloning sites, e.g.. with the 
restriction enzyme cleavage sites BamHL Xbal and Asp718, facilitate the 
cloning of the gene of interest. The vectors contain in addition the 3' intron. 
the polyadenylation and termination signal of the rat preproinsulin gene. 

Example 3(a): Cloning and Expression in COS Cells 

The expression plasmid. pTNFR-5-HA, is made by cloning a portion 
of the cDNA encoding the mature form of the TNF receptor protein into the 
expression vector pcDNAl Amp or pcDN Alii (which can be obtained from 
Invitrogen, Inc.). 

The expression vector pcDNAl amp contains: ( I ) an £. coli origin of 
replication effective for propagation in E. colt and other prokaryotic cells; (2) 
an ampicillin resistance gene for selection of plasmid-containing prokaryotic 
cells; (3) an SY40 ongin of replication for propagation in eukaryotic cells. (4) a 
CMV promoter, a polylinker. an SV40 intron; (5) several codons encoding a 
hemagglutinin fragment (i.e., an "MA" tag to facilitate purification) followed by 
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a termination codon and polyadenylation signal arranged so that a cDNA can 

be conveniently placed under expression control of the CMV promoter and :">« 
operably linked to the SV40 intron and the polyadenylation signal by means of 



restriction sites in the polylinker. The HA tag corresponds to an epitope 
derived from the influenza hemagglutinin protein described by Wilson et al.. 
Cell 37: 767 (1984). The fusion of the HA tag to the target protein allows 
easy detection and recovery of the recombinant protein with an antibody that 
recognizes the HA epitope. pcDNAIII contains, in addition, the selectable 
neomycin marker. 

A DNA fragment encoding the complete TNF receptor polypeptide is 
cloned into the polylinker region of the vector so that recombinant protein 
expression is directed by the CMV promoter. The plasmid construction 
strategy is as follows. The TNF receptor cDNA of a deposited clone is 
amplified using primers that contain convenient restriction sites, much as 
described above for construction of vectors for expression of a TNF receptor 
in E. coli. Suitable primers include the following, which are used in this 
example. The 5' primer for TNFR-5, containing the underlined EcoRI site, has 
the following sequence: 

5' CGC GAATTC CGCCATCATGGCCCGGATCCCCAAG 3' (SEQ ID 
NO:35). The 3' primer, containing the underlined Xbal site, has the following 
sequence: 5' GCC TCTAG A GTAATGAGAAGAGGCAGG 3* (SEQ ID 
NO:36). 

The PCR amplified DNA fragment and the vector, pcDNAI/Amp, are 
digested with Xbal and EcoR! and then ligated. The ligation mixture is 
transformed into E. coli strain SURE (available from Stratagene Cloning 
Systems, 1 1099 North Torrcy Pines Road, La Jolla, CA 92037), and the 
transformed culture is plated on ampicillin media plates which then are 
incubated to allow growth of ampicillin resistant colonies. Plasmid DNA is 
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isolated from resistant colonies and examined by restriction analysis or other 
means for the presence of the fragment encoding the TNFR-5 polypeptide 

For expression of recombinant TNFR-5, COS cells are transfected with 
an expression vector, as described above, using DEAE-DEXTRAN, as 
described, for instance, in Sambrook et al. t Molecular Cloning: a Laboratory 
Manual, Cold Spring Laboratory Press, Cold Spring Harbor, New York 
(1989). Cells are incubated under conditions for expression of TNFR-5 by the 
vector. 

Expression of the pTNFR-5-HA fusion protein is detected by 
radiolabeling and immunoprecipitation, using methods described in, for 
example Harlow et aU Antibodies: A Laboratory Manual, 2nd Ed.; Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, New York (1988). To this end, 
two days after transfection, the cells are labeled by incubation in media 
containing 35 S-cysteine for 8 hours. The cells and the media are collected, and 
the cells are washed and the lysed with detergent-containing RJPA buffer: 1 50 
mM NaCl, 1% NP-40, 0.1% SDS, 1% NP-40, 0.5% DOC, 50 mM TRIS, pH 
7.5, as described by Wilson et al. cited above. Proteins are precipitated from 
the cell lysate and from the culture media using an HA-specific monoclonal 
antibody. The precipitated proteins then are analyzed by SDS-PAGE and 
autoradiography. An expression product of the expected size is seen in the 
cell lysate, which is not seen in negative controls. 

Example 3(b): Cloning and Expression in CHO Cells 

The vector pC4 is used for the expression of TNFR-5, -6a and -6[J 
polypeptides. Plasmid pC4 is a derivative of the plasmid pSV2-dhfr ( ATCC 
Accession No. 37146). The plasmid contains the mouse DHFR gene under 
control of the SV40 early promoter. Chinese hamster ovary- or other cells 
lacking dihydrofolatc activity that are transfected with these plasmids can be 
selected by growing the cells in a selective medium (alpha minus MEM, Life 
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Technologies) supplemented with the chemotherapeutic agent methotrexate. 
The amplification of the DHFR genes in cells resistant to methotrexate (MTX) 
has been well documented (see, e.g., Alt, F. W., Kellems, R. M., Bertino, J. R., 
and Schimke, R. T., 1978. J. Biol. Chem. 255:1357-1370, Hamlin, J. L. and 

Ma, C. 1990, Biochem. et Biophys. Acta. 7097:107-143, Page, M. J. and W>^' 
Sydenham, M. A. 1 99 1 , Biotechnology 9:64-68). Cells grown in increasing 

concentrations of MTX develop resistance to the drug by overproducing the '['-'^ 
target enzyme, DHFR, as a result of amplification of the DHFR gene. If a ; : \ 

second gene is linked to the DHFR gene, it is usually co-amplified and over- 

expressed. It is known in the art that this approach may be used to develop ^ " " 

cell lines carrying more than 1 ,000 copies of the amplified gene(s). 
Subsequently, when the methotrexate is withdrawn, cell lines are obtained 
which contain the amplified gene integrated into one or more chromosome(s) of 
the host cell. 

Plasmid pC4 contains for expressing the gene of interest the strong ^ 
promoter of the long terminal repeat (LTR) of the Rouse Sarcoma Virus 
(Cullen, et al.. Molecular and Cellular Biology. March 1985:438-447) plus a 
fragment isolated from the enhancer of the immediate early gene of human 
cytomegalovirus (CMV) (Boshart et al.. Cell 41:52 1-530 (1985)). 
Downstream of the promoter are the following single restriction enzyme 
cleavage sites that allow the integration of the genes: BamHl, Xba I, and 
Asp718. Behind these cloning sites the plasmid contains the 3' intron and 
polyadenylation site of the rat preproinsulin gene. Other high efficiency 
promoters can also be used for the expression, e.g., the human B-actin 
promoter, the SV40 early or late promoters or the long terminal repeats from 
other retroviruses, e.g., HIV and HTLV1. Clontech's Tet-Off and Tet-On 
gene expression systems and similar systems can be used to express the TNF 
receptor polypeptide in a regulated way in mammalian cells (Gossen. M., & 
Bujard, H. 1992, Prae. Natl. Acad. Set. USA 9:5547-555 1). For the 
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polyadenylation of the mRNA other signals, e.g., from the human growth 
hormone or globin genes can be used as well. Stable cell lines carrying a gene of 
interest integrated into the chromosomes can also be selected upon co- 
transfection with a selectable marker such as gpt. G4 1 8 or hygromycin. It is 
advantageous to use more than one selectable marker in the beginning, e.g.. 
G4 1 8 plus methotrexate. 

The plasmid pC4 is digested with the restriction enrymes appropriate 
for the specific primers used to amplify the TNF receptor of choice as 
outlined below and then dephosphorylated using calf intestinal phosphates by 
procedures known in the art. The vector is then isolated from a 1% agarose 
geL 

The DNA sequence encoding the TNF receptor polypeptide is 
amplified using PCR oligonucleotide primers corresponding to the 5' and 3* 
sequences of the^ej .portion of the gene. The 5' primer for TNFR-5 
containing the underlined Xbal site, has the following sequence: 
5' CGC TCTAGAC CGCCATCATGGCCCGGATCCCCAAG 3' (SEQ ID 
NO:37). The 5' primer for TNFR-6a and -6(5 containing the underlined 
BamHI site, has the following sequence: 

5' CGCGGAJCCGCCATCATGAGGGCGTGGAGGGGCCAG 3' (SEQ 

ID NO:3 1). The 3' primer for TNFR-5. containing the underlined Xbal site, 

" has the following sequence: 

5- GCGTCTAGACTAGTAATGAGAAGAGGCAGG 3* (SEQ ID NO:38). 

The 3' primer for TNFR-60 has the sequence 
5' CGC GGTACC CTCTTTCAGTGCAAGTG 3' (SEQ ID NO:33) 
containing the underlined Asp718 restriction site. The 3' primer for TNFR-6p 
has the sequence 5' CGC GGTACCC TCCTCAGCTCCTGCAGTG 3* (SEQ 
ID NO:34) containing the underlined Asp7l8 restriction site. 
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The amplified fragment is digested with the endonucleases which will 
cut at the engineered restriction site(s) and then purified again on a 1% agarose 
gel. The isolated fragment and the dephosphoryiated vector are then ligated 
with T4 DNA ligase. E. coli HB101 or XL-1 Blue cells are then transformed 
and bacteria are identified that contain the fragment inserted into plasmid pC4 
using, for instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene are used for 
transfection. Five ng of the expression plasmid pC4 is cotransfected with 0.5 
Hg of the plasmid pSVneo using lipofectin (Feigner et al. t supra). The plasmid 
pSV2-neo contains a dominant selectable marker, the neo gene from Tn5 
encoding an enzyme that confers resistance to a group of antibiotics including 
G418. The cells are seeded in alpha minus MEM supplemented with 1 mg/ml 
G41 8. After 2 days, the cells are trypsinized and seeded in hybridoma cloning 
plates (Greiner, Germany) in alpha minus MEM supplemented with 10, 25, or 
50 ng/ml of metothrexate plus 1 mg/ml G418. After about 10-14 days single 
clones are trypsinized and then seeded in 6-weil petri dishes or 10 ml flasks 
using different concentrations of methotrexate (50 nM, 100 nM, 200 nM, 400 
nM, 800 nM). Clones growing at the highest concentrations of methotrexate 
are then transferred to new 6-well plates containing even higher concentrations 
of methotrexate (1 jiM. 2 uM, 5 *iM, 10 mM, 20 mM). The same procedure 
is repeated until clones are obtained which grow at a concentration of 100 - 
200 nM. Expression of the desired gene product is analyzed, for instance, by 
SDS-PAGE and Western blot or by reversed phase HPLC analysis. 

Example 4; Tissue distribution of TSF receptor mRNA expression 

Northern blot analysis is carried out to examine TNFR-5, -6a or -6p 
gene expression in human tissues, using methods described by, among others, 
Sambrook et a/., cited above. A cDNA probe containing the entire nucleotide 
sequence of a TNF receptor protein (SEQ ID NO:l, 3 or 5) is labeled with * : P 
using the r«/iprimc 1M DNA labeling system (Amersham Life Science), 
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Icording to manufacturer's instructions. After labeling, the probe is purified 
ing a CHROMA SPIN- 100™ column (Clontech Laboratories, Inc.), 
according to manufacturer's protocol number PT1 200-1 . The purified labeled 
probe is then used to examine various human tissues for TNF receptor mRN A. 

Multiple Tissue Northern (MTN) blots containing various human 
tissues (H) or human immune system tissues (IM) are obtained from Clontech 
and are examined with the labeled probe using ExpressHyb™ hybridization 
solution (Clontech) according to manufacturer's protocol number PT1 190-1. 
Following hybridization and washing, the blots are mounted and exposed to 
film at -70° C overnight, and films developed according to standard 
procedures. 

It will be clear that the invention may be practiced otherwise than as 
particularly described in the foregoing description and examples. Numerous 
modifications and variations of the present invention are possible in light of 
the above teachings and, therefore, are within the scope of the appended 
claims. 

The entire disclosure of all publications (including patents, patent 
applications, journal articles, laboratory manuals, books, or other documents) 
cited herein are hereby incorporated by reference. 
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(1) GENERAL INFORMATION: 



(i) APPLICANT: WEI , YING-FEI 
NI, JIAN 
FAN, PENG 
EBNER, REINHARD 
YU. GUO-LIANG 
GENTZ, REINER 
RUBEN, STEVEN 

(ii) TITLE OF INVENTION: TUMOR NECROSIS FACTOR RECEPTORS 5, 6 

ALPHA AND 6 BETA 

<iii> NUMBER OF SEQUENCES: 38 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: HUMAN GENOME SCIENCES, INC. 

(B) STREET: 9410 KEY WEST AVENUE 

(C) CITY: ROCKVILLE 

(D) STATE: MD 

(E) COUNTRY: US 

(F) ZIP: 20850 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
{B) FILING DATE: 
(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: BROOKES , ANDERS A 

(B) REGISTRATION NUMBEl : 36,3*73 

(C) REFERENCE/ DOCKET NUMBER: PF341 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (301) 309-8504 

(B) TELEFAX: (301) 309*8512 



12) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1392 base pairs 

(B) TYPE: nucleic acid 
IC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(jx) i"-* - PS: 

(A) NAME/KEY: CDS 

(B) LOCATION : 183.-959 



<xi! SEQUENCE DESCRIPTtOH : SEQ ID NO:l: 

CCTCTCCACG OCACGAACT CAGCCAACGA TTTCTGATAG ATTTTTCGGA GTTTGACCAG 

AGATGCAACG Gul^AGGAG CGCTTCCTAC CGTTAGGGAA CTCTGGGGAC AGAGCGCCCC 

GGCCGCCTGA TGGCC 3 ".GGC AGGGTGCGAC CCAGGACCCA GGACGCCGTC GGGAACCATA 

CC ATG GCC CGG ATC CCC AAG ACC CTA AAG TTC GTC GTC GTC ATC GTC 
Set Ala Arg lie Pro Lys Thr Leu Lys Phe Val Val Val He val 
! 5 10 15 

GCG GTC CTG CTG CC, 3TC CTA GCT TAC TCT GCC ACC ACT GCC CGG CAG 
S 55 HI Su Pre val Leu Ala Tyr Ser Ala Thr Thr Ala Arg Gin 

20 25 

GAG GAA GTT CCC CAG TAG ACA GTG GCC CCA CAG CAA CAG AGG CAC AGC 
S! Su VaT Pro Gin Gin Thr val Ala Pro Gin Gin Gin Arg His Ser 
35 40 45 

TTC AAG GGG GAG GAG TGT CCA GCA GGA TCT CAT AGA TCA GAA CAT ACT 
IS £s Gly Glu Glu. Cys Pro Ala Gly Ser His Arg Ser rlu Hxs Thr 
50 55 

PGA GCC TGT AAC CCG TGC ACA GAG GGT GTG GAT TAC ACC AAC GCT TCC 
Ty Ala Cyl £n Pro Cys Thr Glu Gly Val Asp Tyr Thr Asn Al> Ser 
65 ™ 75 

AAC AAT GAA CCT TCT TGC TTC CCA TGT ACA GTT TGT AAA TCA GAT CAA 
£n £n Glu Pro Ser Cys Phe Pro Cys Thr Val Cys Lys Ser Asp 

60 65 90 

IAA CAT AAA ACT TCC TGC ACC ATG ACC AGA CAC ACA GTG TGT CAG TGT 
SI Hil tyt Ser Ser Cys Thr Met Thr Arg Asp Thr Val Cys Gin Cyr 
100 105 

riVA GGC ACC TTC CGG AAT GAA AAC TCC CCA GAG ATG TGC CGG AAG 
£ SI S Thr £e Arg Asn Glu Asn Ser Pro Glu Met Cys Arg T-y. 
115 120 12J 

TGT AGC AGG TGC CCT AGT GGG GAA GTC CAA GTC ACT AAT TOT A£ T£ 
Cys Ser Arg Cys Pro Ser Gly Glu Val Gin Val Ser > . Cys Thr 
130 135 

TGG GAT GAT ATC CAG TGT GTT CAA GAA TTT GGT GCC AAT GCC ACT GTG 
Trp Asp IZ Ue Gin Cys Val Glu Glu Phe Gly Ala Asn Ala Thr Val 
145 . 150 155 



60 
120 
180 
227 

275 

323 

371 

419 

467 

515 

563 



611 



659 



re/** 



GAA ACC 
Glu Thr 
160 

GCC CCA 
Ala Pro 



CCA GCT 
Pro Ala 



GCT GCT 
Ala Ale. 



GCT 
Ala 



TAC 
Tyr 
240 

ATT 
lie 



GAA 
Glu 
225 

CTC 
Leu 



GTG 
Val 
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CCA GCT GCT GAA GAT ACA ATG AAC ACC AGC CCG GGG ACT CCT 

Pro Ala Ala Glu Clu Thr Met Asn Thr Ser Pro Gly Thr Pro 

165 170 175 

GCT GCT GAA GAG ACA ATG AAC ACC AGC CCA GGG ACT CCT GCC 
Ala Ala Glu 'ilu Thr Met Asn Thr Ser Pro Gly Thr Pro Ala 
180 185 1« 

GCT GAA GAC ACA ATG ACC ACC AGC CCG GGG ACT CCT GCC CCA 
Ala Glu Glu Met Thr Thr Ser Pro Gly Thr Pro Ala Pro 

195 200 205 

GAA GAG ACA ATG ACC ACC AGC CCG GGG ACT CCT GCC CCA GCT 
Glu Glu Thr Met Thr Thr Ser Pro Gly Thr Pro Ala Pro Ala 
210 215 220 

GAG ACA ATG ACC ACC AGC CCG 3GG ACT CCT GCC TCT TCT CAT 
Glu Thr Met Thr Thr Ser Pro Gly Thr Pro Ala Ser Ser His 
230 235 



TCA TGC 
Ser Cys 



TTT GTT 
Phe Val 



ACC ATC GTA GGG ATC ATA GTT CTA ATT GTG CTT CTG 
Thr He Val Gly He He Val Leu He Val Leu Leu 
245 250 255 

TGAAAGACTT CACTGTGGAA GAAATTCCTT C CTT AC CTG A 



AAGGTTCAGG 
CTCTGCTGTG 
CCTGGCI CTA 
CCCTGGTCTC 
AAGAGGCAGG 
TCAACTGGGT 
GAGCAGAACA 



TAGGCGCTGG CTGAGGGCGG GGGGCGCTGG ACACTCTCTG CCCTOCCTCC 
TTCCCACAGA C AG AAACGC C TGCCCCTOCC CCAAGTCCTO GTGTCTCCAG 
TCTTCCTCCT TGTGATCGTC CCATCCCCAC ATCCCGTGCA CCCCCCAGGA 
ATCAGTCCCT CTCCTGGAGC TGGGGGTCCA CACATCTCCC AGCCAAGTCC 
GCCAGTTCCT CCCATCTTCA GGCCCAGCCA GGCAGGGGGC AGTCGCCTCC 

GACAAGGGTG AGGATGAGAA GTGGTCACGG GATTTATTCA GCCTTGGTCA 

CAGAGATTTT CCGTGAAAAA AAA 



707 

755 

803 

851 

899 

947 

s99 

1059 
1119 
1179 
1239 
1299 
1359 
1392 



* 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 2: 



^ 84 

Met Al- Arg He Pro Lye Thr Leu Lys Phe Val Val val He Val Al. 
x 5 10 ls 

val Leu Leu Pro Val Leu Ala Tyr Ser Al- Thr Thr Al. Arg Gin Glu 
20 25 

Glu Val Pro Gin Gin Thr Val Ala Pro Gin Gin Gin Arg His Ser Phe 
35 40 45 

Lvs Gly Glu Glu Cys Pro Ala Gly Ser His Arg Ser Glu His Thr. Gly 
50 55 «0 

Ala Cys Asn Pro Cys Thr Glu Gly Val Asp Tyr Thr Asn Ala Ser Asn 

65 7 ° 75 

Asn Glu Pro Ser Cys Phe Pro Cys Thr Val Cys Lys Ser Asp Gin Lys 
8S 9° 95 



His Lys ser Ser Cys Thr Met Thr Arg Asp Thr Val Cys Gin Cys Lys 
100 105 ."0 

Glu Gly Thr Phe Arg Asn Glu Asn Ser Pro Glu Met Cys Arg Lys Cys 



115 12° 125 

Ser Arg Cys Pro Ser Gly Glu Val Gin Val Ser Asn Cys Thr Ser Trp 

Asp Asp lie Gin Cys Val Glu Glu Phe Gly Ala A*n Ala Thr Val Glu 

Thr Pro Ala Ala Glu Glu Thr Met Asn Thr Ser Pro Gly Thr Pro Ala 
ice; 170 I 75 



165 

Thr Met Asn Thr Ser Pro Gly Thr 

185 I 90 



Pro Ala Ala Glu Glu Thr Met Asn Thr Ser Pro Gly Thr Pro Ala Pro 



Ala Al 

19S 



180 

a Glu Glu Thr Met Thr Thr Ser Pro Gly Thr Pro Ala Pro Ala 



200 205 



Ala Glu Glu Thr Met Thr Thr Ser Pro Gly Thr Pro Ala Pro Ala Ala 
210 215 220 

Glu Glu Thr Met Thr Thr Ser Pro Gly Thr Pro Ala Ser Ser Hi. Tyr 
225 230 235 

Leu ser Cys Thr He Val Gly He lie val Leu lie Val Leu Leu lie 
245 250 2" 

Val Phe Val 



(2) INFORMATION FOR SEO ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10*77 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 25.. 924 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GCTCTCCCTG CTCCAGCAAG GACC ATG AGG GCG CTG GAG GGG CCA GGC CTG 

Met Arg Ala Leu Glu Gly Pro Gly Leu 
1 S 

TCG CTG CTG TGC CTG GTG TTG GCG CTG CCT GCC CTG CTG CCG GTG CCG 
Ser Leu Leu Cys Leu Val Leu Ala Leu Pro Ala Leu Leu Pro Val Pro 
10 15 20 " 

GCT GTA CGC GGA GTG GCA GAA ACA CCC ACC TAC CCC TGG CGG GAC GCA 
Ala Val Arg Gly val Ala Glu Thr Pro Thr Tyr Pro Trp Arg Asp Ala 
30 35 40 

GAG ACA GGG GAG CGG CTG GTG TGC GCC CAG TGC CCC CCA GGC ACC TTT 
Glu Thr Gly Glu Arg Leu Val Cys Ala Gin Cys Pro Pro Gly Thr Phe 
45 SO 55 

GTG CAG CGG CCG TGC CGC CGA GAC AGC CCC ACG ACG TGT GGC CCG TGT 
val Gin Arg Pro Cys Arg Arg Asp Ser Pro Thr Thr Cys Gly Pro Cys 
60 6S ™ 

CCA CCG CGC CAC TAC ACG CAG TTC TGG AAC TAC CTG GAG CGC TGC CGC 
Pro Pro Arg His Tyr Thr Gin Phe Trp Asn Tyr Leu Glu Arg Cys Arg 
75 80 85 

TAC TGC AAC GTC CTC TGC GGG GAG CGT GAG GAG GAG GCA CGG GCT TGC 
Tyr Cys Asn Val Leu Cys Gly Glu Arg Glu Glu Glu Ala Arg Ala Cys 
90 95 100 105 

CAC GCC ACC CAC AAC CGT GCC TGC CGC TGC CGC ACC GGC TTC TTC GCG 
His Ala Thr His Asn Arg Ala Cys Arg Cys Arg Thr Gly Phe Phe Ala 
110 H5 - 20 

CAC GCT GGT TTC TGC TTG GAG CAC GCA TCG TGT CCA CCT CGT GCC GGC 
His Ala Gly Phe Cys Leu Glu His Ala Ser Cys Pro Pro Gly Ala Gly 
12 5 130 1 3S 

GTG ATT GCC CCG GGC ACC CCC AGC CAG AAC ACG CAG TGC CAG CCG TGC 
val lie Ala Pro Gly Thr Pro Ser Gin Asn Thr Gin Cys Gin Pro Cys 
140 "5 150 



51 



99 



147 



195 



243 



291 



339 



387 



435 



483 



"5* 



r 





^ 86 

CCC CCA GGC ACC TTC TCA GCC AGC AGC TCC AGC TCA GAG CAG TGC CAG 

Pro Pro GXy Thr Phe Ser Ala Ser Ser Ser Ser Ser Glu Gin Cys Gin 

1S5 160 1«5 

CCC CAC CGC AAC TGC ACG GCC CTG GGC CTG GCC CTC AAT GTG CCA GGC 
Pro His Arg Asn Cys Thr Ala Leu Gly Leu Ala Leu Asn Val Pro Gly 
170 175 180 "5 

TCT TCC TCC CAT GAC ACC CTG TGC ACC AGC TGC ACT GGC TTC CCC CTC 
IZ sir Ser His Asp Thr Leu Cys Thr Ser Cys Thr Gly Phe Pro Leu 
190 195 200 

AGC ACC AGG GTA CCA GGA GCT GAG GAG TGT GAG CGT GCC GTC ATC GAC 
Ser Tnr Arg Val Pro Gly Ala Glu Glu Cys Glu Arg Ala Val He Asp 
205 210 215 

TTT GTG GCT TTC CAG GAC ATC TCC ATC AAG AGG CTG CAG CGG CTG CTG 
Phe val Ala Phe Gin Asp He Ser lie Lys Arg Leu Gin Arg Leu Leu 
220 225 230 

CAG GCC CTC GAG GCC CCG GAG GGC TGG GGT CCG ACA CCA AGG GCG GGC 

Gin 111 Leu Glu Ala Pro Glu Gly Trp Gly Pro Thr Pro Arg Ala Gly 

235 240 245 

CGC GCG GCC TTG CAG CTG AAG CTG CGT CGG CGG CTC ACG GAG CTC CTG 
Arg S! Ala Leu Gin Leu Lys Leu Arg Arg Arg Leu Thr Glu Leu Leu 
250 255 260 

GGG GCG CAG GAC GGG GCG CTG CTG GTG CGG CTG CTG CAG GCG CTG CGC 
Gly Ala Gin Asp Gly Ala Leu Leu Val Arg Leu Leu Gin Ala Leu Arg 

270 275 

GTG GCC AGG ATG CCC GGG CTG GAG CGG AGC GTC CGT GAG CGC TTC CTC 
Va! Ala Arg Met Pro Gly Leu Glu Arg Ser Val Arg Glu Arg Phe Leu 
285 290 295 

CCT GTG CAC TGATCCTGGC CCCCTCTTAT TTATTCTACA TCCTTGG CAC 
Pro Val His 
300 

CCCACTTGCA CTGAAAGAGG CTTTTTTTTA AATAGAAGAA ATGAGGTTTC TTAAAGCTTA 
TTTTTATAAA GCTTTTTCAT AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



531 



579 

627 

675 

723 

771 

819 

867 

915 

964 

1024 
1077 



r . ' 
r" -"V 



.1^ 



1 r . 

• - 



m 



0 






87 



Met Arg Ala Leu Glu Gly Pro Gly Leu Ser Leu Leu Cys Leu Val Leu 

Ala Leu Pro Ala Leu Leu Pro Val Pro Ala Val Arg Gly val Ala Glu 
20 25 30 

Ala Glu Thr Gly Glu Arg Leu Val 



Thr Pro Thr Tyr Pro Trp Arg Asp 



35 



40 



45 



CVS - Ala Gin Cys Pro Pro Gly Thr Phe Val Gin Arg Pro Cys Arg Arg 
50 55 60 

Asp Ser Pro Thr Thr Cys Gly Pro Cys Pro Pro Arg His Tyr Thr Gin 

70 75 



65 



Phe Trp Aan Tyr Leu Glu Arg Cys Arg Tyr Cys Asn Val Leu Cys Gly 



85 90 95 



Glu Arg Glu Glu Glu Ala Arg Ala Cys His Ala Thr His Asn Arg Ala 

a.00 105 
Cys Arg Cys Arg Thr Gly Phe Phe Ala His Ala Gly Phe Cys Leu Glu 



115 



His Ala Ser Cys Pro Pro Gly Ala Gly Val lie Ala Pro Gly Thr Pro 

135 140 



130 



Pro Pro Gly Thr Phe Ser Ala 



Ser Gin Asn Thr Gin Cys Gin Pro Cys 
145 150 155 

Ser Ser Ser Ser Ser Glu Gin Cys Gin Pro His Arg Asn Cys Thr Ala 
165 170 175 

Leu Gly Leu Ala Leu Asn Val Pro Gly Ser Ser Ser His Asp Thr Leu 



180 



185 



Cys Thr Ser Cys Thr Gly Phe Pro Leu Ser Thr Arg Val Pro Gly Ala 
195 200 205 

Glu Glu Cys Glu Arg Ala Val lie Asp Phe Val Ala Phe Gin Asp lie 

215 220 



210 



Ser 



He Lys Arg Leu Gin Arg Leu Leu 



Gin Ala Leu Glu Ala Pro Glu 



235 



240 



225 230 

Gly Trp Gly Pro Thr Pro Arg Ala Gly Arg Ala Ala Leu Gin Leu Lys 
245 250 255 

Leu Arg Arg Arg Leu Thr Glu Leu Leu Gly Ala Gin Asp Gly Ala Leu 



260 



265 



Leu Val Arg Leu Leu Gin Ala Leu Arg Val Ala Arg Met Pro Gly Leu 
275 280 285 



> "-TV 



2* 






88 

Glu Arg Ser Val Arg Glu Arg Phe Leu Pro Val His 
290 295 300 

(2) INFORMATION FOR SEQ ID NO:S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1667 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iX> FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 73.. 582 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : S : 
TGGCATGTCG GTCAGGCACA GCAGGGTCCT GTGTCCGCGC TGAGCCGCGC TCTCCCTGCT 

CCAGCAAGGA CC ATG AGO GCG CTG GAG GGG CCA GGC CTG TCG CTG CTG 
C M et Arg Ala Leu Glu Gly Pro Gly Leu Ser Leu Leu 

! 5 10 

TGC CTG GTG TTG GCG CTG CCT GCC CTG CTG CCG GTG CCG GCT GTA CGC 
Cys Leu Val Leu Ala Leu Pro Ala Leu Leu Pro Val Pro Ala Val Arg 
15 20 25 

GGA GTG GCA GAA ACA CCC ACC TAC CCC TGG CGG GAC GCA GAG ACA GGG 
Gly" Val Ala Glu Thr Pro Thr Tyr Pro Trp Arg Asp Ala Glu Thr Gly 
30 35 « 

GAG CGG CTG GTG TGC GCC CAG TGC CCC CCA GGC ACC TTT GTG CAG CGG 
Glu Arg Leu Val Cys Ala Gin Cys Pro Pro Gly Thr Phe Val Gin Arg 
45 50 55 

CCG TGC CGC CGA GAC AGC CCC ACG ACG TGT GGC CCG TGT CCA CCG CGC 
Pro Cys Arg Arg Asp Ser Pro Thr Thr Cys Gly Pro Cys Pro Pro Arg 
65 70 b 

CAC TAC ACG CAG TTC TGG AAC TAC CTG GAG CGC TGC CGC TAC TGC AAC 
His Tyr Thr Gin Phe Trp Asn Tyr Leu Glu Arg Cys Arg Tyr Cys Asn 
80 65 90 

GTC CTC TGC GGG GAG CGT GAG GAG GAG GCA CGG GCT TGC CAC GCC ACC 
Val Leu Cys Gly Gi«.. Arg Glu Glu Glu Ala Arg Ala Cys His Ala Thr 
95 100 105 

CAC AAC CGT GCC TGC CGC TGC CGC ACC GGC TTC TTC GCG CAC GCT GOT 

£. Asn Arg Ala Cys Arg Cys Arg Thr Gly Phe Phe Ala His Ala Gly 

110 1" . 120 



60 
108 

156 

204 

252 

300 

348 

396 

444 






W rar CAC GCA TCG TGT CCA CCT GGT GCC GGC GTG ATT GCC 
£e Z ™ SS SE S. Ser Cys Pro Pro Gly Ala Oly Val 11. Jl. 

125 1JU 

^ w nrr TGG GCG AGG GGA GGG GCC CCC AGG AGT GGT GGC CGG 
III Sy tit ser S S A r 9 Gly Oly Al. Pro Ar g Ser Gly Gly Ar 9 
145 150 

TGT ggc AGG GGT CAG GTT GCT GGT CCC AGC CTT GCA CCC 
IS SI Sy U Gly Gin Val Ala Gly Pro Ser Leu Ala Pro 

160 168 
TGAGCTAGGA CACCAGTTCC CCTGACCCTG TTCTTCCCTC CTGGCTGCAG GCACCCCCAG 
CCAGAACACG CAGTGCCAGC CGTGCCCCCC AGGCACCTTC TCAGCCAGCA GCTCCAGCTC 
AGAGCAGTGC CAC.CCCACC GCAACTGCAC GGCCCTGGGC CTGGCCCTCA ATGTGCCAGG 
CTCTTCCTCC CATGACACCC TGTGCACCAG CTGCACTGGC TTCCCCCTCA GCACCAGGGT 
ACCAGGTGAG CCAGAGGCCT GAGGGGGCAG CACACTGCAG GCCAGGCCCA CTTGTGCCCT 
CACTCCTGCC CCTGCACGTG CATCTAGCCT GAGGCATGCC AGCTGGCTCT GGGAAGGGGC 
CACAGTGGAT TTGAGGGGTC AGGGGTCCCT CCACTAGATC CCCACCAAGT CTGCCCTCTC 
AGGGGTGGCT GAGAATTTGG ATCTGAGCCA GGGCACAGCC TCCCCTGGAG AGCTCTGGGA 
AAGTGGGCAG CAATCTCCTA ACTGCCCGAG GGGAAGGTGG CTGGCTCCTC TGACACGGGG 
AAACCGAGGC CTGATGGTAA CTCTCCTAAC TGCCTGAGAG GAAGGTGGCT GCCTCCTCTG 
ACATGGGGAA ACCGAGGCCC AATGTTAACC ACTGTTGAGA AGTCACAGGG GGAAGTGACC 
CCCTTAACAT CAAGTCAGGT CCGGTCCATC TGCAGGTCCC AACTCGCCCC TTCCGATGGC 
CCAGGAGCCC CAAGCCCTTG CCTGGGCCCC CTTGCCTCTT GCAGCCAAGG TCCGAGTGGC 
CGCTCCTGCC CCCTAGGCCT TTGCTCCAGC TCTCTGACCG AAGGCTCCTG CCCCTTCTCC 
AGTCCCCATC GTTGCACTGC CCTCTCCAGC ACGG CTCACT GCACAGGGAT TTCTCTCTCC 
TGCAAACCCC CCGAGTGGGG CCCAGaW AGGGTACCTG GCAGCCCCCG CCAGTGTGTG 
TGGGTGAAAT GATCGGACCG CTGCCTCCCC ACCCCACTGC AGGAGCTGAG GAGTGTGAGC 
GTGCCGTCAT CGACTTTGTG GCTTTCCAGG ACATCTCCAT CAAGAGGAGC GGCTGCTGCA 
GGCCC 



(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 amino acids 



540 

582 

642 
702 
762 
822 
882 
942 
1002 
1062 
1122 
1182 
1242 
1302 
1362 
1422 
1482 
1542 
1602 
1662 
1667 



PI 



Or, 








^ 90 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Met Arg Ala Leu Glu Gly Pro Gly Leu Ser Leu Leu Cys Leu Val Leu 
1 S 10 15 

Ala Leu Pro Ala Leu Leu Pro Val Pro Ala Val Arg Gly Val Ala Glu 
20 25 30 

Thr Pro Thr Tyr Pro Trp Arg Asp Ala Glu Thr Gly Glu Arg Leu Val 
35 40 45 

Cys Ala Gin Cys Pro Pro Gly Thr Phe Val Gin Arg Pro Cys Arg Arg 
50 55 60 

Asp Ser Pro Thr Thr Cys Gly Pro Cys Pro Pro Arg His Tyr Thr Gin 
65 70 75 80 

Phe Trp Asn Tyr Leu Glu Arg Cys Arg Tyr Cys Asn Val Leu Cys Gly 
65 90 95 

Glu Arg Glu Glu Glu Ala Arg Ala Cys His Ala Thr His Asn Arg Ala 
100 105 110 

Cys Arg Cys Arg Thr Gly Phe Phe Ala His Ala Gly Phe Cys Leu Glu 
115 120 125 

His Ala Ser Cys Pro Pro Gly Ala Gly Val lie Ala Pro Gly Glu Ser 
130 135 140 

Trp Ala Arg Gly Gly Ala Pro Arg Ser Gly Gly Arg Arg Cys Gly Arg 
145 150 155 160 

Gly Gin Val Ala Gly Pro Ser Leu Ala Pro 
165 170 



i; 



7^ 



(2) INFORMATION FOR SEQ ID NO : 7 : 



ti> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLuGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 







Tyr He His Pro Gin Asn Asn Ser lie Cys Cys Thr Lys Cys His LyB 
50 55 «0 

Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pro Gly Gin Asp Thr Asp 



65 



70 



Cys Arg Glu Cys Glu Ser Gly Ser Phe 

65 90 



75 

Thr Ala Ser Glu Asn His Leu 
95 



Arg His Cys Leu Ser Cys Ser Lys Cyr Arg Lys Glu Met Gly Gin Val 

10S 110 



100 



Glu lie Ser Ser Cys Thr Val Asp Arg Asp Thr Val Cys Gly Cys Arg 

115 "0 125 

Lys Asn Gin Tyr Arg His Tyr Trp Ser Glu Asn Leu Phe Gin Cys Phe 
130 "5 140 

Cys Leu Asn Gly Thr Val His Leu Ser Cys Gin Glu 



Asn Cys Ser Leu 
145 



150 



155 



Lys Gl 



n Asn Thr Val Cys Thr Cys His Ala Gly Phe Phe Leu Arg Glu 



165 



170 



Asn Glu Cys Val Ser Cys Ser Asn 

180 I 85 



Cys Lys Lys Ser Leu Glu Cys Thr 
190 



Lys Leu Cys Leu Pro Gin lie Glu Asn Val Lys Gly Thr Glu Asp Ser 
195 



Gly Thr Thr Val Leu Leu Pro Leu 



200 

Val He Phe Phe Gly Leu Cys Leu 



210 



215 



220 



Leu Ser Leu Leu Phe lie Gly Leu Met Tyr Arg Tyr Gin Arg Trp Lys 

225 230 235 

Ser Lys Leu Tyr Ser lie Val Cys Gly Lys Ser Thr Pro Glu Lys Glu 
245 250 *55 

Gly Glu Leu Glu Gly Thr Thr Thr Lys Pro Leu Ala Pro Asn Pro Ser 
Y -~ 265 270 



260 



Phe Ser Pro Thr Pro Gly Phe Thr Pro Thr Leu Gly Phe Ser Pro Val 



275 



280 




92 



Pro Ser Ser Thr Phe Thr Ser Ser Ser Thr Tyr Thr Pro Gly Asp Cys 
290 2»5 300 

Pro Asn Phe Ala Ala Pro Arg Arg Glu Val Ala Pro Pro Tyr Gin -Gly 
305 310 315 320 

Ala Asp Pro lie Leu Ala Thr Ala Leu Ala Ser Asp Pro He Pro 
325 330 335 



Asn 



Pro Leu Gin Lys Trp Glu Asp Ser Ala His Lys Pro Gin Ser Leu Asp 
340 345 350 

Thr Asp Asp Pro Ala Thr Leu Tyr Ala Val Val Glu Asn Val Pro Pro 

36 5 



355 



360 



Leu Arg Trp Lys Glu Phe Val Arg Arg Leu Gly Leu Ser Asp His Glu 

375 380 



370 



lie Asp Arg Leu Glu Leu 

385 390 395 



Gin Asn Gly Arg Cys Leu Arg Glu Ala Gin 

400 



Tyr Se 



r Met Leu Ala Thr Trp Arg Arg Arg Thr Pro Arg Arg Glu Ala 
405 410 415 



Thr Leu Glu Leu Leu Gly Arg Val Leu Arg Asp Met Asp Leu Leu Gly 

425 430 



420 



Cys Leu Glu Asp II 
435 



e Glu Glu Ala Leu Cys Gly Pro Ala Ala Leu Pro 



440 



445 



Pro Ala Pro Ser Leu Leu Arg 
450 4 *S 

(2) INFORMATION FOR SEQ ID NO : 8 : 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(i.) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : B : 

Met Ala Pro Val Ala Val Trp Ala Ala Leu Ala Val Gly Leu Glu Leu 
-! 5 1° 15 

Trp Ala Ala Ala His Ala Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr 

25 30 



20 



,V i 



* v 



93 



Ala Pro Glu Pro Gly Ser Thr Cye Arg Leu Arg Glu Tyr Tyr Asp Gin 
35 «° 45 



Thr 



Ala Gin Met Cys Cys Ser Lys Cys Ser Pro Gly Gin His Ala Lys 



50 



55 



60 



Val Phe Cys 
65 



Thr Lys Thr Ser Asp Thr Val Cys Asp Ser Cys Glu Asp 



70 



75 



Ser Thr Tyr Thr Gin Leu Trp Asn Trp Val Pro Glu Cys Leu Ser Cys 
85 90 



Gly Ser Arg Cys Ser Ser Asp 

100 105 



Gin Val Glu Thr Gin Ala Cys Thr * g 

110 



Glu Gin Asn Arg lie Cys Thr Cys Arg Pro Gly Trp Tyr Cys Ala Leu 



115 



120 



Ser Lys Gin Glu Gly Cys Arg Leu Cys Ala Pro Leu Arg Lys Cys Arg 
130 I" 140 

Pro Gly Phe Gly Val Ala Arg Pro Gly Thr Glu Thr Ser Asp Val Val 



145 



150 



155 



Ser Asn Thr Thr Ser Ser Thr 



L70 



175 



Cys Lys Pro Cys Ala Pro Gly Thr Phe Sc 
165 11 

Asp lie Cys Arg Pro His Gin lie Cys Asn Val Val Ala lie Pro Gly 



180 



185 



Asn Ala Ser Arg Asp Ala Val Cys Thr Ser 

200 205 



Thr Ser Pro Thr Arg Ser 



19S 



Pro Gin Pro Val Ser Thr Arg Ser 



Met Ala Pro Gly Ala Val His Leu 

210 220 
Gin His Thr Gin Pro Thr Pro Glu Pro Ser Thr Ala Pro Ser Thr Ser 



225 



230 



235 



Phe Leu Leu Pro Met Gly Pro Ser Pro Pro Ala Glu Gly Ser Thr Gly 



Asp Vhe Ala Leu Pro Val Gly Leu lie Val Gly Val Thr Ala Leu Gly 

260 265 

Leu Leu lie lie Gly Val Val Asn Cys Val He Met Thr Gin Val Lys 

a. O D 



275 



280 



In Arg Glu Ala Lys Val Pro His Leu Pro 



lys Lys Pro Leu Cys Leu Gin Arg 

290 295 300 



Ala Asp Lys Ala Arg Gly Thr 



Gin Gly Pro Glu Gin Gin His Leu Leu 



305 



310 



315 



320 



sate. 





94 



He Thr Ala Pro Ser Ser Ser Ser Se 

325 330 , 335 



r Ser Leu Glu Ser Ser Ala Ser 

330 , 335 

Ala Leu Asp Arg Arg Ala Pro Thr Arg Asn Gin Pro Gin Ala Pro Gly 

345 350 



340 



val Glu Ala Ser Gly Ala Gly Glu Ala Arg Ala Ser Thr Gly Ser Ser 
355 360 365 

Asp Ser Ser Pro Gly Gly His Gly Thr Gin Val Asn Val Thr Cys lie 
370 375 380 



Val Asn 
385 



Val Cys Ser Ser Ser Asp His Ser 



Ser Gin Cys Ser Ser Gin 



390 



395 



4 00 



Ala Ser Ser Thr Met Gly Asp Thr Asp Ser Ser Pro Ser Glu Ser Pro 
405 4,0 415 

Lvs Asp Glu Gin Val Pro Phe Ser Lys 
420 425 

Gin Leu Glu Thr Pro Glu Thr Leu Leu Gly Ser Thr Glu Glu Ly Pro 

44 5 



405 410 

Glu Glu Cys Ala Fhe Arg Ser 



430 



435 



440 



Leu Pro Leu Gly Val Pro Asp Ala Gly Met Lys Pro Ser 
450 455 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



J • 



r /'. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Gly Ala Gly Ala Thr Gly Arg Ala Met Asp Gly Pro Arg Leu Leu 
! 5 10 15 

Leu Leu Leu Leu Leu Gly Val Ser Leu Gly Gly Ala Lys Glu Ala Cys 

25 30 



20 



Pre Thr Gly 
35 



Leu Tyr Thr His Ser Gly Glu Cys Cys Lys Ala Cys Asn 
40 45 



Leu Gly 
50 



W Glu Gly val Ma Gin Pro Cys Gly Ala Asn Gin Thr Val Cys 
^ y 55 60 





95 



Phe Ser Asp Val Val Ser Ala Thr 

80 ... 



Glu Pro Cya Leu Asp Ser Val Thr 

Glu Pro Cys Lys Pro Cys Thr Glu Cys val Gly Leu Gin Ser Met Ser 

90 95 



85 



Ala Pro Cys Val Glu Ala Asp Asp 

100 105 



Ala Val Cys Arg Cys Ala Tyr Gly 
110 



Tyr Tyr Gin Asp Glu Thr Thr Gly Arg Cys Glu Ala Cys Arg Val Cys 



115 



120 



Glu Ala Gly Ser Gly L*u Val Phe Ser Cys Gin Asp Lys Gin Asn Thr 

13L 140 



130 



Val Cys Glu Glu Cys Pro Asp 

ISO 155 



Glv Thr Tyr Ser Asp Glu Ala Asn His 
7 160 



145 



Val Asp Pro Cys Leu Pro Cys Thr 

165 170 



Val Cys Glu Asp Thr Glu Arg Gin 

175 



Leu Arg Glu Cys Thr Arg Trp Ala Asp Ala Glu Cys Glu Glu lie Pro 
180 185 



Glu Gly Ser Asp Ser Thr 



Gly Arg Trp lie Thr Arg Ser Thr Pro Pro ^ 

Ala Pro Ser Thr Gin Glu Pro Glu Ala Pro Pro Glu Gin Asp Leu lie 
210 .215 2^0 

Ala Ser Thr Val Ala Gly Val Val Thr Thr Val Met Gly Ser Ser Gin 



225 



230 



235 



Pro Val Val Thr Arg Gly Thr Thr Asp Asn Leu lie Pro Val Tyr Cys 



245 



25C 



Ser lie Leu Ala Ala Val Val Val Gly Leu Val Ala Tyr lie Ala Phe 

270 



260 



Lys Arg Trp Asn Ser Cys Lys 

275 280 



265 

Gin Asn Lys Gin Gly Ala Asn Ser Arg 
285 



Pro Va 
290 



1 Asn Gin Thr Pro Pro Pro Glu Gly Glu Lys Leu His Ser Asp 
295 300 



r Gin Ser Leu His Asp Gin Gin Pro His 

3^0 



Ser Gly He Ser Val Asp Se 

305 310 315 

Thr Gin Thr Ala Ser Gly Gin Ala Leu Lys Gly Asp Gly Gly Leu Tyr 
325 330 335 

Ser Ser Leu Pro Pro Ala Lys Arg Glu Glu Val Glu Lys Leu Leu Asn 
340 345 350 



v:v>r:. 



4- 



1 



» « 




0 



97 



Val Tyr Leu Asp Asn Glu Cys Val His Cys Glu Glu 



135 



140 



Gly Met Ser Cys 
130 

Glu Arg Leu Val Leu Cys Gin Pro Gly Thr Glu Ala Glu Val Thr Asp 
145 150 155 1«° 

Glu He Met Asp Thr Asp Val Asn Cys Val Pro Cys Lys Pro Gly His 
165 17 <> 175 

Phe Gin Asn Thr Ser Ser Pro Arg Ala Arg Cys Gin Pro His Thr Arg 
180 185 190 

Cvs Glu lie Gin Gly Leu Val Glu Ala Ala Pro Gly Thr Ser Tyr Ser 
7 200 205 



195 



Pro Glu Pro Gly Ala Met Leu Leu Leu 



Asp Thr He Cys Lys Ajn Pro 

210 215 220 



Ala He Leu Leu Ser Leu Val Leu Phe Leu Leu Phe Thr Thr Val Leu 
225 230 235 

Ala Cys Ala Trp Met Arg His Pro Ser Leu Cys Arg Lys Leu Gly Thr 



245 



250 



Leu Leu Lys Arg His Pro Glu Gly Glu Glu Ser Pro Pro Cys Pro Ala 



260 



265 



Pro Arg Ala Asp Pro His Phe Pro Asp Leu Ala Glu Pro Leu Leu Pro 

280 * 85 



2*75 



Met Ser Gly Asp Leu Ser Pro Ser Pro Ala Gly Pro Pro Thr Ala Pro 
290 295 300 

Ser Leu Glu Glu Val Val Leu Gin Gin Gin Ser Pro Leu Val Gin Ala 
305 31° 315 



Arg Glu Leu Glu Ala Glu Pro Gly Glu His Gly Gin Val Ala His Gly 
325 330 335 

Ala Asn Gly He His Val Thr Gly Gly Ser val Thr Val Thr Gly Asn 
340 345 350 

lie Tyr lie Tyr Asn Gly Pro Val Leu Gly Gly Thr Arg Gly Pro Gly 
355 360 365 

Asp Pro Pro Ala Pro Pro Glu Pro Pro Tyr Pro Thr Pro Glu Glu Gly 
370 375 380 

Ala Pro Gly Pro Ser Glu Leu Ser Thr Pro Tyr Gin Glu Asp Gly Lys 
390 395 



385 



Ala Trp His Leu Ala Glu Thr 



Glu Thr Leu Gly Cys Gin Asp Leu 



405 



410 



415 



(Z) INFORMATION FOR SEQ ID NO: 11: 



' ■ 0 



V/. 




FT? 



«3 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



Met Leu Gly He Trp Thr Leu Leu Pro Leu Val Leu Thr Ser Val Ala 
5 10 15 



Arg Leu Ser Ser Lys Ser Val Asn Ala Gin Val Thr Asp He Asn Ser 
20 25 



30 



Lys Gly Leu Glu Leu Arg Lys Thr Val Thr Thr Val Glu Thr Gin Asn 
35 40 45 

Leu Glu Gly Leu His His Asp Gly Gin Phe Cys His Lys Pro Cys Pro 
50 55 60 

Pro Gly Glu Arg Lys Ala Arg Asp Cys Thr Val Asn Gly Asp Glu Pro 
65 70 75 80 

Asp Cys Val Pro Cys Gin Glu Gly Lys Glu Tyr Thr Asp Lys Ala His 
85 90 95 



ay ; 



y r - 



> « 



7 

^ ^ V 



Phe Ser Ser Lys Cys Arg Arg Cys Arg Leu Cys Asp Glu Gly "is Gly 
100 105 HO 

Leu Glu Val Glu lie Asn Cys Thr Arg Thr Gin Asn Thr Lys Cys Arg 
115 120 125 

Cys Lys Pro Asn Phe Phe Cys Asn Ser Thr Val Cys Glu His Cys Asp 

135 140 



130 



Glu His Gly He He Lys Glu Cys Thr Leu Thr 



150 



155 



160 



Pro Cys Thr Lys Cys 
145 

Ser Asn Thr Lys Cys Lys Glu Glu Gly Ser Arg Ser Asn Leu Gly Trp 

170 1 ? 5 



165 



Leu Cys Leu Leu Leu Leu Pro 

180 I 85 



He Pro Leu He Val Trp Val Lys Arg 

190 



Lys Glu Val Gin Lys Thr Cys Arg Lys His Arg Lys Glu Asn Gin Gly 
195 200 205 

Ser His Glu Ser Pro Thr Leu Asn Pro Glu Thr Val Ala He Asn Leu 
210 215 220 



*1 





i 



99 



Ser Asp Val Asp Leu Ser Lys Tyr lie Thr Thr lie Ala Gly Val Met 



225 



Thr Leu Ser Gin Val Lys Gly Phe Val Arg Lys Asn Gly Val Asn Glu 



245 



250, 



Ala Lys 



He Asp Glu He Lys Asn Asp Asn 



Val Gin Asp Thr Ala Glu 



260 265 270 



Gin Lys Val Gin Leu Leu Arg Asn Trp His Gin Leu His Gly Lys Lys 
275 280 285 

Glu Ala Tyr Asp Thr Leu lie Lys Asp Leu Lys Lys Ala Asn Leu Cys 



290 



295 



300 



Thr Leu Ala Glu Lys lie Gin Thr He He Leu Lys Asp He Thr Ser 



305 



Asp Ser 



310 



315 



Glu Asn Ser Asn Phe Arg Asn Glu He Gin Ser Leu Val 



325 



330 



335 



/V > - 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



* < 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ala Arg Pro His Pro Trp Trp Leu Cys Val Leu Gly Thr Leu Val 
c 10 lb 



Gly Leu Ser Ala Thr Pro Ala Pro Lys Ser Cys Pro Glu Arg His Tyr 
20 25 



30 



Trp Ala Gin Gly Lys Leu Cys Cys Gin Met Cys Glu Pro Gly Thr Phe 
35 40 4S 

Leu Val Lvs Asp Cys Asp Gin His Arg Lys Ala Ala Gin Cys Asp Pro 

SO ' SS 60 

Cys lie Pro Gly Val Ser Phe Ser Pro Asp His His Thr Arg Pro His 



65 



70 



Cys Glu Ser Cys Arg His Cys Asn Ser Gly Leu Leu Val Arg Asn Cys 

90 95 



85 



V 






re 

: i 



Thr lie 



Thr Ala Asn Ala Glu Cye Ala Cys Arg Aon Gly Trp Gin Cys 



100 



105 



110 



Arg Asp Lys G lu C V S 



Thr Glu Cys Asp Pro Leu Pro Asn Pro Ser Leu 



115 



120 



125 



Thr Ala Arg Ser Ser Gin Ala Leu Ser Pro His Pro Gin Pro Thr His 

135 140 



130 



Glu Ala Arg Thr Ala Gly His Met 

160 



Leu Pro Tyr Val Ser Glu Met Leu 
14S 150 155 

Gin Thr Leu Ala Asp Phe Arg Gin Leu Pro Ala Arg Thr Leu Ser Thr 

170 175 



165 



Ser Ser Asp Phe lie Arg He 



His Trp Pro Pro Gin Arg Ser Leu Cys 

185 190 



180 



Leu 



val He Phe Ser Gly Met Phe Leu Val Phe Thr Leu Ala Gly Ala 



195 



200 



205 



Leu Phe Leu His Gin Arg Arg Lys Tyr Arg Ser Asn Lys Gly Glu Ser 



210 



215 



220 



Pro val Glu Pro Ala Glu Pro Cys Arg Tyr Ser Cys Pro Arg Glu Glu 



225 



230 



Glu Gly Ser Thr He Pro lie Gin Glu Asp Tyr Arg Lys Pro Glu Pro 



245 



250 



Ala Cys Ser Pro 
260 




(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SLQ ID NO: 13: 

Met Arg Val Leu Leu Ala Ala Leu Gly Leu Leu Phe Leu Gly Ala Leu 
5 10 15 



Arg 



Ala Phe Pro Gin Asp Arg Pro Phe Glu Asp Thr Cys His Gly Asn 



20 



25 



30 



t 



VP. 

1 





Hie Tyr Tyr Asp Lys Ala Val Arg Arg Cye Cys Tyr Arg Cys 



Pro Met Gly Leu Phe Pro Thr Gin Gin Cys Pro Gin Arg Pro Thr Asp 
50 55 60 

Cys Arg Lys Gin Cys Glu Pro Asp Tyr Tyr Leu Asp Glu Ala Asp Arg 



65 



70 



75 



Cys Thr Ala Cys Val Thr Cys Ser Arg Asp Asp Leu Val Glu Lys Thr 



85 



90 



95 



Pro Cys Ala Trp Asn Ser Ser Arg Val Cys Glu Cys Arg Pro Gly Met 
100 105 110 

Phe Cys Ser Thr Ser Ala Val Asn Ser Cys Ala Arg Cys Phe Phe His 
115 120 125 

Ser Val Cys Pro Ala Gly Met lie Val Lys Phe Pro Gly Thr Ala Gin 



Ki- 



rn 




r,' o 



Lvs Asn Thr Val Cys Glu Pro Ala Ser Pro Gly Val Ser Pro Ala Cys 

155 160 



145 



150 



Ala Ser Pro Glu Asn Cys Lys Glu Pro Ser Ser Gly Thr He Pro Gin 
165 1^0 175 

Ala Lys Pro Thr Pro Val Ser Pro Ala Thr Ser Ser Ala Ser Thr Met 



1B0 



185 



190 



Pro Val Arg Gly Gly Thr Arg Leu Ala Gin Glu Ala Ala Ser Lys Leu 
19S 200 205 

Thr Arg Ala Pro Asp Ser Pro Ser Ser Val Gly Arg Fro Ser Ser Asp 
210 215 220 

Pro Gly Leu Ser Pro Thr Gin Pro Cys Pro Glu Gly Ser Gly Asp Cys 



225 



230 



235 



240 



Arg Lys Gin Cys Glu Pro Asp Tyr Tyr Leu Asp Glu Ala Gly Arg Cys 
245 250 255 

Thr Ala Cys Val Ser Cys Ser Arg Asp Asp Leu Val Glu Lys Thr Pro 
260 265 270 

Cys Ala Trp Asn Ser Ser Arg Thr Cys Glu Cys Arg Pro Gly Met He 
275 280 285 

Cys Ala Thr Ser Ala Thr Aon Ser Cys Ala Arg Cys Val Pro Tyr Pro 
290 295 300 

He Cys Ala Ala Glu Thr Val Thr Lys Pro Gin Asp Met Ala Glu Lys 
305 HO 315 320 



i 






102 

Asp Thr Thr Phe Glu Ala Pro Pro Leu Gly Thr Gin Pro Asp Cys Asn 
325 330 335 

Pro Thr Pro Glu Asn Gly Glu Ala Pro Ala Ser Thr Ser Pro Thr Gin 
340 "5 35° 

Ser Leu Leu Val Asp Ser Gin Ala Ser Lys Thr Leu Pro He Pro Thr 
355 360 3 «5 

Ser Ala Pro Val Ala Leu Ser Ser Thr Gly Lys Pro val Leu Asp Ala 
370 3 " «° 

Gly Pre val Leu Phe Trp Val lie Leu Val Leu Val Val Val val Gly 
38S «° 395 

Ser Ser Ala Phe Leu Leu Cys His Arg Arg Ala Cys Arg Lys Arg He 
405 410 415 

Arg oln Lys Leu His Leu Cys Tyr Pro Val Gin Thr Ser Gin Pro Lys 
420 «25 

Leu Glu Leu val Asp Ser Arg Pro Arg Arg Ser Ser Thr Gin Leu Arg 



435 



440 



445 



Ser Gly Ala Ser Val Thr Glu Pro Val Ala Glu Glu Arg Gly Leu Met 
450 4SS 46° 

Ser Gin Pro Leu Met Glu Thr Cys His Ser Val Gly Ala Ala Tyr Leu 
465 470 47S 480 



Glu Ser 



Leu Pro Leu Gin Asp Ala Ser Pro Ala Gly Gly Pro Ser Ser 



485 



490 



495 



Pro Arg Asp Leu Pro Glu Pro Arg Val Ser Thr Glu His Thr Asn Asn 

501 505 510 

Lys lie Glu Lys lie Tyr He Met Lys Ala Asp Thr Val lie Val Gly 

520 525 



blS 



Glu Gly Arg Gly Leu Ala Gly Pro Ala 



Thr Val Lys Ala Glu Leu Pro 

530 535 540 

Glu Pro Glu Leu Glu Glu Glu Leu Glu Ala Asp His Thr Pro His Tyr 
545 550 555 560 

Pro Glu Gin Glu Thr Glu Pro Pro Leu Gly Ser Cys Ser Asp Val Met 
565 570 575 

Leu Ser Val Glu Glu Glu Gly Lys Glu Asp Pro Leu Pro Thr Ala Ala 
S80 585 590 



55 



i 



j 




Ser Gly Lys 
595 

(2) INFORMATION FOR SEQ ID NO: 14 : 




a 1 P ■ r; ~ifrir '^ a = >r '- ft**** run 



0 



103 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 277 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Val Arg Leu Pro Leu Gin Cys Val Leu Trp Gly Cys Leu Leu Thr 
5 10 15 

Ala Val His Pro Glu Pro Pro Thr Ala Cys Arg Glu Lys Gin Tyr Leu 
20 25 30 

lie Asn Ser Gin Cys Cys Ser Leu Cys Gin Pro Gly Gin Lys Leu Val 



35 



40 



45 



Ser Asp Cys Thr Glu Phe Thr Glu Thr Glu Cys Leu Pro Cys Gly Glu 
50 S5 6* 

Ser Glu Phe Leu Asp Thr Trp Asn Arg Glu Thr His Cys His Gin His 
65 ^O 75 80 

Lys Tyr cys Asp Pro Asn Leu Gly Leu Arg Val Gin Gin Lys Gly Thr 
85 9° 95 

Ser Glu Thr Asp Thr lie Cys Thr Cys Glu Glu Gly Trp His Cys Thr 
100 . 105 H0 

Ser Glu Ala Cys Glu Ser Cys Val Leu His Arg Ser Cys Ser Pro Gly 
!X5 120 12S 

Gly Val Lys Gin He Ala Thr Gly Val Ser Asp Thr lie Cys Glu 



130 



135 



140 



Pro Cys Pro Val Gly Phe Phe Ser Asn Val Ser Ser Ala Phe Glu Lys 



145 



150 



155 



160 



Cys His Pro Trp Thr Ser Cys Glu Thr Lys Asp Leu Val Val Gin Gin 
165 I 70 175 

Ala Gly Thr Asn Lys Thr Asp Val Val Cys Gly Pro Gin Asp Arg Leu 
1B0 185 190 

Arg Ala Leu Val Val He Pro He He Phe Gly He Leu Phe Ala He 
195 200 205 

Leu Leu Val Leu Val Phe He Lys Lys Val Ala Lys Lys Pro Thr Asn 
210 215 220 



• 




1* 

i 



. - ii 




f 4 

m 

Jit ■■<>/. 



r 





1 04 



Lvs Ala Pro Hie Pro Lye Gin Glu Pro Gin Glu lie Asn Phe Pro Asp 

- 230 240 



225 



235 



Asp Leu Pro Gly Ser Asn Thr Ala Ala Pro Val Gin Glu Thr Leu His 
245 255 



Gly Cys Gin Pro 
260 



250 255 
Val Thr Gin Glu Asp Gly Lys Glu Ser Arg He Ser 



265 



270 



Val Gin Glu Arg Gin 
275 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS ; single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TVPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



Met Gly Asn Ser Cys Tyr Asn 

5 10 



lie Val Ala Thr Leu Leu Leu Val Leu 

15 



As:. Phe Glu Arg Thr Arg Ser Leu Gin Asp Pro Cys Ser Asn Cys Pro 



20 



25 



30 



Ala Gly Thr Phe Cys Asp Asn Asn Arg Asn Gin He Cys Ser Pro Cys 

40 45 



35 



Pro Pro Asn Ser Phe Se 

50 55 



r Ser Ala Gly Gly Gin Arg Thr Cys Asp He 

60 



1 Phe Arg Thr Arg Lys Glu Cys Ser Ser 

80 



Cys Arg Gin Cys Lys Gly Va 

65 7 ° 75 

Thr Ser Asn Ala Glu Cys Asp Cys Thr Pro Gly Phe His Cys '-eu Gly 
85 90 95 

Ala Gly Cys Ser Met Cys Glu Gin Asp Cys Lys Gin Gly Gin Glu Leu 
100 105 HO 

Thr Lys Lys Gly Cys Lys Asp Cys Cys Phe Gly Thr Phe Asn Asp Gin 



115 



120 



Lys Arg Gly He Cys Arg Pro Trp Thr Asn Cys Ser Leu Asp Gly Lys 
130 135 1<° 



i 



Gin Leu Cys Thr Ala Thr Gin Asp Thr Val Cys Arg Cys Arg Al* Gly 
100 105 

Thr Gin Pro Leu Asp Ser Tyr Lye Pro Gly Val Asp Cys Ala Pro Cys 



115 



120 



125 



Pro Pro Gly His Phe Ser Pro Gly Asp Asn Gin Ala Cys Lys Pro Trp 

135 140 



130 



Thr Asn Cys Thr Leu Ala Gly Lys His Thr Leu Gin Pro Ala Ser Asn 

- ~ — 160 



145 



150 



155 



Ser 



Ser Asp Ala He Cys Glu Asp Arg Asp Pro Pro Ala Thr Gin Pro 



165 



170 



Gin Glu Thr Gin Gly Pro Pro Ala Arg Pro He Thr Val Gin Pro Thr 

185 190 



160 



Glu 



Ala Trp Pro Arg Thr Ser Gin Gly Pro Ser Thr Arg Pro Val Glu 



195 



200 



205 



Val Pro Gly Gly Arg Ala Val Ala Ala He Leu Gly Leu Gly Leu Val 

215 220 



210 



Leu Gly Leu Leu Gly Pro Leu 

225 230 235 



Ala He Leu Leu Ala Leu Tyr Leu Leu 

240 



Arg 



Arg Asp Gin Arg Leu Pro Pro Asp Ala His Lys Pro Pro Gly Gly 



245 



250 



Gly Ser Phe Arg Thr Pro I 



le Gin Glu Glu Gin Ala Ai-p Ala His Ser 



260 



265 



270 



Thr Leu Ala Lys He 
275 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
CD) TOPOLCCiY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 17: 



Met Lys Sar Val Leu 



Tyr Leu Tyr He Leu Phe Leu Ser Cys He He 



3 0 



15 




0fr 

A A 



_ - i 



'OX 



* • 
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He Asn Gly Arg Asp Ala Ala Pro Tyr Thr Pro Pro Asn Gly Lys Cys 

20 - 25 - — 3.Q.- 



Lys Asp Thr 



Glu Tyr Lys Arg His Asn Leu Cys Cys Leu Ser Cys Pro 



45 



35 40 
Pro Gly Thr Tyr Ala Ser Arg Leu Cys Asp Ser Lys Thr Asn Thr Gin 



50 



55 



60 



Cvs Thr Pro Cys Gly Ser Gly Thr Phe Thr Ser Arg Asn Asn His Leu 
65 ^O 75 r.0 

Pro Ala Cys Leu Ser Cys Asn Gly Arg Cys Asn Ser Asn Gin Val Glu 
85 90 95 

Thr Arg Ser Cys Asn Thr Thr His Asn Arg He Cys Glu Cys Ser Pro 
100 105 110 

Gly Tyr ryr Cys Leu Leu Lys Gly Ser Ser Gly Cys Lys Ala Cys Val 

120 125 



115 



Se 



r Gin Thr Lys Cys Gly He Gly Tyr Gly Val Ser Gly His Thr Ser 



130 



135 



140 



Val Gly Asp Val He Cys Ser Pro Cys Gly Phe Gly Thr Tyr Ser His 
14S 150 155 "0 

Thr Val Ser Ser Ala Asp Lys Cys Glu Pro Val Pro Asn Asn Thr Phe 
165 170 175 



Asn Tyr He Asp Val 



Glu He Thr Leu Tyr Pro Val Asn Asp Thr Ser 



180 



185 



190 



C*s Thr Arg Thr Thr Thr Thr Gly Leu Ser Glu Ser He Leu Thr Ser 

195 200 205 

Glu Leu Thr He Thr Met Asn His Thr Asp Cys Asn Pro Val Phe Arg 
210 215 220 



Gl 



225 



u Glu Tyr Phe Ser Val Leu Asn Lys Val Ala Thr Ser Gly Phe Phe 



230 



235 



Thr Gly Glu Asn Arg Tyr Gin Asn He £ er Lys Val Cys Thr Leu Asn 
245 :50 255 

Phe Glu He Lys Cys Asn Asn Lys Gly Ser Ser Phe Lys Gin Leu Thr 
260 265 270 

Lys Ala J.ys Asn Asp Asp Gly Met Met Ser His Ser Glu Thr Val Thr 
275 280 



285 



Leu Ala Gly Asp Cys Leu Ser Ser Val Asp He Tyr He Leu Tyr Ser 
290 300 



f - 

- 



^ 108 

Asn Thr Asn Ala Gin Asp Tyr Glu Thr Asp Thv He Ser Tyr Arg Val 
305 310 315 320 

Glv Asn Val Leu Aso Asp Asp Ser His Met Pro Gly Ser Cys Asn lie 
325 330 335 

His Lys Pro He Thr Asn Ser Lys Pro Thr Arg Phe Leu 
340 345 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 




—4 




(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18: 



Met Lys Ser Tyr He Leu Leu Le 

5 10 



u Leu Leu Ser Cys He He He He 

15 



Asn Ser Asp He Thr Pro His Glu Pro Ser ten Gly Lys Cys Lys Asp 
20 25 30 

Asn Glu Tyr Lys Arg His His Leu Cys Cys Leu Ser Cys Pro Pro Gly 



35 



40 



45 



Thr Tyr Ala Ser Arg Leu Cys As? Ser Lys Thr Asn Thr Asn Thr Gin 

55 60 



50 



Cys Thr Pro Cys Ala Ser Asp 

65 7 ° 75 



r Asp Thr Phe Thr Ser Arg Asn Asn His L?u 

0 



Pro Ala Cys Leu Ser Cys Asn Gly Arg Cys Asp Ser Asn Gin Val Glu 

90 95 



85 



j .' 



Thr Arg Ser Cys Asn Thr 



Thr His Asn Arg He Cys Asp Cys Ala Pro 



100 



10S 



110 



Gly Tyr Tyr Cys Phe Leu Lys Gly Ser Ser Gly Cys Lys Ala Cys Val 
115 120 125 

Ser Gin Thr Lys Cys Gly He Gly Tyr Gly Val Ser Gly His Thr Pro 
130 135 140 



Thr Gly Asp Val Val Cys Ser Pro Cys Gly Leu Gly Thr t r Ser His 
145 150 15S l<-° 





109 



Thr Val Ser Ser 



Val Asp Lye Cys Glu Pro Val Pro Ser Asn Thr Phe 



165 



170 



175 



Asn Tyr He Asp Val Glu He Aen Leu Tyr Pro Val Asn Asp Thr Ser 
180 18S 



Cys Thr Arg 
195 



Thr Thr Thr Thr Gly Leu Ser Glu Ser He Ser Thr Ser 



200 



205 



Glu Leu Thr Ha Thr Met Asn His Lys Asp Cys Asp Pro Val Phe Arg 
210 215 220 

Asn Gly Tyr Phe Ser Val Leu Asn Glu Val Ala Thr Ser Gly Phe Phe 
225 230 235 240 

Thr Gly Gin Asn Arg Tyr Gin Asn He Ser Lys Val Cys Thr Leu Asn 
245 250 255 

Phe Glu He Lys Cys Asn Asn Lys Asp Ser Tyr Ser Ser Ser Lys Gin 
260 265 270 

Leu Thr Lys Thr Lys Asn Asp Asp Asp Ser He Met Pro His Ser Glu 
275 280 285 

c e r Val Thr Leu Val Gly Asp Cys Leu Ser Ser Val Asp He Tyr He 
290 295 300 

Leu Tyr Ser Asn T,.r Asn Thr Gin Asp Tyr Glu Thr Asp Thr He Ser 



305 



310 



315 



Tyr His Val Gly Asn Val Leu Asp Val Asp Ser His Met Pro Gly Arg 
325 330 335 



Cys Asp Thr His Lys Leu He Thr Asn Ser Asn Ser Gin Tyr Pro Thr 
340 345 350 

His Phe Leu 
355 

(2) INFORMATION FOR SEQ ID NO; 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 506 base oairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

<ii) MOLECULE TYPE: DNA (genomic) 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GAATTCGGCA NAGCCTCTCC ACGCGCAGAA CTCAGCCAAC GATTTCTGAT AGATTTTTGG 



60 



5* 




A 4 




.4 ;, : >'^ 




ACCGTTAGGA 
CCCAGGACCC 
TTCGTGGTCG 
CGGCAGAGGA 
GGGNAGGAGT 
CACAAGGGTT 
TT 1 1 AATTAG 



110 

GAGTTTGACC AGAGATGCAA GGGGTGAAGG AGCGCTTCCT 
CAGNNCGCCC CGGCCGCCTG ATGGCCGAGG CAGGGTGCGA 
CGGGAACCAT ACCATGGCCC GGATCCCCAA GACCCTAAAG 
GGTCCTGCTG CCAGTCCTAG CTTACTCTGC CACCACTGCC 
CAGNCANTGG NCCCACAGCA ACAGNGGCAC AGTTTCAAGG 
GTTTTTATAG TTCAGAACNT ATTGGNG CTN TNAACCCTTG 
CAANGTTTCC AANATGNACT TTTTNGTTCC CTGTTANATT 
ATTTNTNAAC CTTNCCNGGG NAAATT 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ IT> NO: 20: 
GGCAGAGGTG TCTCCAGCCT GGCTCTATCT TCCTCCTTGT NATCGTCCCA 
CCGTGCACL ; CCCAGGACCC TGGTCTCATC AGTCCCTCTC CTGGAGCTGG 
ATCTCCCAGC CAAGTCCAAG AGGGCAGGGC CAGTTCCTCC CATCTTCAGG 
CAGGGGGCAG TCGGCTCCTC AACTGGGTGA CAAGGGTGAG GATGAGAAGT 
ATTTATTCAG CCTTGGTCAG AG C AG AACAC AGATTTTTCC GTGTGTTGGT 
NTTCCCCTTC TTNATNCCCC TTTCN 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



actctgggga 
aggacggcgt 
tcatcgtcgc 
agttncccag 
tttcca:jcaa 
tggnttaaac 
ttnaanttaa 



TCCCCACATC 
GGGTCCACAC 
CCCAGCCAGG 
GGTCACGGGG 
TTTTACTCTN 



(ii) MOLECULE TYPE: DNA (genomic) 



120 
180 
240 
300 
360 
420 
480 
506 



60 
120 
180 
240 
300 
325 



l 




J* 



£-4 



i ox; 



CCGGGGACTC CTGNNTCTNC 
ATTGTGCCTT CTAATTGTTT 
ACCTGTAAGT TNCAGGTAGG 
CCTGCCTCCC TCTGNCTGTT 
TNGTGTTTTC CAGCCTGGCT 



(XX) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
GGCAGAGGCC C C AG CTGCTG AAGAGACAAT AATCACCAGC 
TNATTACCTC TNATGCACCA TCGTAGGGAT CATAGTTCTA 
TTGTTTGAAA AGANTTCACT GTGGAAGAAA TTCCTTCCTT 
NGCCTGGCTG AGGGCGGGGG GCGCTGGTAC ACTCTCTGAC 
TTC CC AC AG A CAGAAACGCC TGCCCCTGNC CCCAAGTTCC 
CTATCTTNNC TCCTTGTGAA TCGTTCCCAT CCCCACANGC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : single 
(D) TOPOIiOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CCAGGGTCTC CTNCCCCACC TGCTGAAGAG ACANTGACCA CCAGCCCGGG GACTCCTGCC 
TCTTCCTCAT TACCICTNAT GNANCATCGT AGGGATCATA GTTCTAATTG TG C CTTCTG A 
ATTGTG CTTT GTTT'J 3 AAAG ACTTCACTGT GGGAAGAAAT TCCTTCCTTA CCTGAAGTTG 
CAGGTAGGCC CTGGGTNAGG GCGNGGGGCG CTGGACANTN TCTGGNCCTG GCTGCCCGCT 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GGCACGAGCA GGGTCCTGTN TCCGCCCTOA GCCGCGCTCT NCCTGCTCCA GCAAGGACCA 



60 
120 
180 
240 
300 
340 



60 
120 
180 
240 
241 



60 








GTGTTGGCGC 
ACNTACCCCT 
GCACCTTTNT 
CGCGCCACTA 
CGGGGAGCGT 
GCCGNACCGG 
ATTN ATT J AC 



TGCCTGCCCT 
GGCGGGACGN 
GCAGCGGCCG 
CACGCATTCT 
NAGGAGGAGG 
TTT CTTCGNG 
GNAGTGATTN 



TGAGGGCGCT GGAGGGGCCA GGCCTGTCGC TGCTGTCCTG 
GCTGCCGGTG CCGGCTGTAC GCGGAGTGGC AGAAACACNN 
AGAGACAGGG GAG CGG CTGG TGTNTNCCCA NTGCCCCCA3 
TGCCGNCGAG ACAGCCCCAC GACGTGTGGC CCGTNTCCAC 
GGAACTACCT GGAGCGCTGN CCTTACTNCA ACGTCCTCTG 
CACGGGTTTN CCACGNCAAC CACAACCCNG GNTTACCGTN 
GCAAGTTGGT TTTTNNTTTG GAGNAAGGAT TCGTGTTNCA 
. NNCNCGGGAA ACTNAAA 

(2) INFORMATION FOR SEQ ID NO: 24; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CGCAACTGCA CGGCCCTGGG ACTGGCCCTC AATGTGCCAG GNTCTTCCTC CCATGACACC 
CTGTG CACCA GCTGCACTGG CTTCCCCCTC AGACCAGGGT ACCANGAGCT GAGGAGTGTG 
AGCNTGCCGT CATCGACTTT TTGGCTTTCC AGGACATCTC CATCAAGAGG CTGCAGCGGC 
TGCTCANGCC 

(2) INFORMATION FOR SEQ ID NO:2S: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CGCGGATCCA CCACTGCCCG GCAGGAG 



120 
180 
240 
300 
360 
420 
480 
497 



60 
120 
180 
190 



*\ * 
1 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
GCGTCTAGAC TAGTAATGAG A/.GAGGCAGG 
(2) INFORMATION FOR SEQ ID NO:27: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



i * 



*> * - ■ ■ 



30 



l ■ 



>7*» V 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 27; 

CGCCCATGGC AGAAACACCC ACCTAC 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 
•.A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



26 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

CGCAAGCTTC TCTTTCAGTG CAAGTG 

(2) INFORMATION FOR SEQ ID NO: 29. 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2B base pairs 




114 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CGCAAGCTTC TCCTCAGCTC CTGCAGTG 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



F" 3 
y 



28 





(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

CGCTCTAGAC CGCCATCATG GCCCGGATCC CCAAG 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
(A ^ LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CGCGGATCCG CCATCATGAG GGCGTGGAGG GGCCAG 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



36 
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(ii) MOLECULE TYPE: DNA (genomic) 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 32; 
GCGTCTAGAC TAGTAATGAG AAGAGGCAGG 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 





(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 
CGCGGTACCC TCTTTCAGTG CAAGTG 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3< 
CGCGGTACCC TCCTCAGCTC CTGCAGTG 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



X 



« 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 35: 
CGCGAATTCC GCCATCATGG CCCGGATCCC CAAG 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: 5LQ ID NO: 36: 
GCGTCTAGAG TAATGAGAAG AGGCAGG 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
CGCTCTAGAC CGCCATCATG GCCCGGATCC CCAAG 
(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDErNESS : single 
(Di TOPOLOGY- linear 

(ii) MOLECULE TYPE: DNA (genomic) 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 B : 
JTCTAGAC TAGTAATGAG AAGAGGCAGG 
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m Is Claimed Is: 

1 An isolated nucleic acid molecule comprising a polynucleotide having a 

nucleotide sequence at least 95% identical to a sequence selected from the group consisting of: 

(a) a nucleotide sequence encoding a TNFR polypeptide having the complete amino 
acid sequence in SEQ ID NO:2. 4 or 6 or as encoded by a cDNA clone contained in ATCC 
Deposit No. 97798. 97810 or 97809; 

(b) a nucleotide sequence encoding a mature TNFR polypeptide having an amino acid 
sequence at positions 27-259. 31-300 or 31-170 in SEQ ID NO:2. 4 and 6. respectively, or as 
encoded by the cDNA clone contained in the ATCC Deposit No. 97798. 97810 or 97809; 

(c) a nucleotide sequence encoding the soluble extracellular domain of a TNFR 
polypeptide having the amino acid sequence at positions 27-240. 31-283 or 31-166 of SEQ ID 
NOS:2, 4 and 6. respectively; and 

(d) a nucleotide sequence complementary to any of the nucleotide sequences in (a). 

(b). or (c) above. 

2. The nucleic acid molecule of claim 1 wherein said polynucleotide has a 
complete nucleot.de sequence selected from the group consisting of SEQ ID NO:l. SEQ ID 
NO:3 and SEQ ID NO:5. 
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3. The nucleic acid molecule of claim I wherein said polynucleotide has a 
nuclcot.de sequence which encodes a TNFR polypeptide having a complete amino acid 
sequence selected from the group consist.ng of SEQ ID NO:2. SEQ ID NO:4 and SEQ ID 
NO:6. 

4. The nucle.c ac.d molecule of claim 1 wherein sa.d polynucleotide has a 
nucleotide sequence encoding the mature form of a TNFR polypeptide havmg an am.no acid 
sequence from about 27 to about 259 ,n SEQ ID NO:2. from about 3 1 to about 300 in SEQ 
ID NO:4. or from about ^ ! to about 170 in SEQ ID NO:6. 
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5. The nucleic acid molecule of claim 1 wherein said polynucleotide has a 
nucleotide sequence encoding the soluble extracellular domain of a TNFR polypept ide having 
the amino acid sequence from about 27 to about 240 in SEQ ID NO:2, from about 31 to about 
28? in SEQ ID NO:4, or from about 31 to about 166 of SEQ ID NO:6. 

6. An isolated nucleic acid molecule comprising a polynucleotide having a 
nucleotide sequence at least 95% identical to a sequence selected from the group consisting of: 

(a) a nucleotide sequence encoding a polypeptide comprising the amino acid 
sequence of residues m-259 of SEQ ID NO:2, where m is an integer in the range of 1-53; 

(b) a nucleotide sequence encoding a polypeptide comprising the amino acid 
sequence of residues n-300 of SEQ ID NO:4, where n is an integer in the range of 1-49; 

(c) a nucleotide sequence encoding a polypeptide comprising the amino acid 
sequence of residues n-170 of SEQ ID NO:6, where n is an integer in the range of 1-49; 

(d) a nucleotide sequence encoding a polypeptide comprising the amino acid 
sequence of residues 1-x of SEQ ID NO:2 % where x is an integer in the range of 149-259: 

(e) a nucleotide sequence encoding a polypeptide comprising the amino acid 
sequence of residues 1-y of SEQ ID NO:4, where y is an integer in the range of 193-300; 

(0 a nucleotide sequence encoding a polypeptide comprising the amino acid 
sequence of residues 1-z of SEQ ID NO:6, where z is an integer in the range of 132-170: -id 

(g) a nucleotide sequence encoding a polypeptide having the amino acid sequence 
consisting of residues m-x of SEQ ID NO:2. n-y of SEQ ID NO:4, or n-z of SEQ ID NO-'. . 
m, n. x, y and z are defined in (a), (b), (c), (d), (e) and (f) above. 

7. An isolated nucleic acid molecule comprising a polynucleotide having a 
nucleotide sequence at least 95% identical to a sequence selected from the group consisting of: 

(a) a nucleotide sequence encoding a polypeptide consisting of a portion of a 
complete TNFR amino acid sequence encoded by a cDNA clone contained in ATCC Deposit 
No. 97798, 97810 or 97809 wherein said portion excludes from I to about 52. from I to 
about 48 uiul from 1 to about 48 amino acids from the amino terminus of said complete 
amino acid sequence encoded by the cDN A clone contained in ATCC Deposit No. 977t) S . 
97KID and 97809, respectively; 
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(b) a nucleotide sequence encoding a polypeptide consisting of a portion of a 
complete TNFR amino acid sequence encoded by a cDNA clone contained in ATCC Deposit 
No. 97798, 97810 or 97809 wherein said portion excludes from 1 to about 1 10, from I to 
about 107, or from 1 to about 38 amino acids from the carboxy terminus of said complete 
amino acid sequence encoded by the cDNA clone contained in ATCC Deposit No. 97798. 
97810 and 97809. respectively; and 

(c) a nucleotide sequence encoding a polypeptide consisting of a portion of a complete 
TNFR amino acid sequence encoded by the cDN A clone contained in ATCC Deposit No. 
97798, 97810 or 97809, wherein said portion includes a combination of any of the amino 
terminal and carboxy terminal deletions for the respective clones in (a) and (b), above. 

8. The nucleic acid molecule of claim I wherein said polynucleotide has the 
complete nucleotide sequence of the cDNA clone contained in ATCC Deposit No. 97798. 
97810 or 97809. 

9 The nucleic acid molecule of claim 1 wherein said polynucleotide has the 
nucleotide sequence encoding a TNFR polypeptide having the complete amino acid sequence 
encoded by the cDNA clone contained in ATCC Deposit No. 97798, 97810 or 97809. 

10. The nucleic acid molecule of claim I wherein said polynucleotide has the 
nucleotide sequence encoding a mature TNFR polypeptide having the amino acid sequence 
encod 



3dcd by the cDNA clone contained in ATCC Deposit No. 97798, 97810 or 97809. 



11. An isolated nucleic acid molecule comprising a polynucleotide which 
hybridizes under stringent hybridization conditions to a polynucleotide having a nucleotide 
sequence identical to a nucleotide sequence in (a), (b). (c), or (d) of claim 1 w herein said 
polynucleotide which hybridizes does not hybridize under stringent hybridization conditions 
to a polynucleotide having a nucleotide sequence consisting of only A residues or of only T 
residues. 
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1 2. An isolated nucleic acid molecule comprising a polynucleotide which encodes 
the amino acid sequence of an epitope-bearing portion of a TNFR polypeptide having an 
amino acid sequence in (a), (b), (c) or(d) of claim I. 

13. The isolated nucleic acid molecule of claim 12, which encodes an 
epitope-bearing portion of a TNFR polypeptide comprising amino acid residues selected from 
the group consisting of: from about Gln-42 to about Glu-52 in SEQ ID NO:2, from about 
His-58 to about Cys-66 in SEQ ID NO:2, from about Pro-68 to about Thr-76 in SEQ ID 
NO:2, from about Ser-79 to about Cys-85 in SEQ ID NO:2, from about Cys-9I to about Thr- 
102 in SEQ ID NO:2, from about Ghvl 10 to about Pro- 122 in SEQ ID NO:2, from about 
Arg-126 to about VaI-136 in SEQ ID NO:2, from about Thr-142 to about Glu-148 in SEQ ID 
NO:2, from about Ala-31 to about Thr-46 in SEQ ID NO:4, from about Phe-57 to about 
Thr-1 17 in SEQ ID NO:4, from about Cys-132 to about Thr-175 in SEQ ID NO:4, from 
about Gly-185 to about Thr-1 94 in SEQ ID NO:4, from about Val-205 to about Asp-217 in 
SEP ID NO:4, from about Pro-239 to about Leu-264 in SEQ ID NO:4, and from about Ala- 
283 to about Pro-298 in SEQ ID NO:4, from about AIa-3l to about Thr-46 in SEQ ID NO:6. 
from about Phe-57 to about Gln-80 in SEQ ID NO:6, from about GIu-86 to about His- 106 in 
SEQ ID NO:6, from about Thr-1 08 to about Phe-1 19 in SEQ ID NO:6, from about His- 129 
to about VaI-138 in SEQ ID NO:6. and from about Gly-142 to about Pro-166 in SEQ ID 
NO:6 to about in SEQ ID NO:6. 

14. A method for making a recombinant vector comprising inserting an isolated 
nucleic acid molecule of claim 1 into a vector. 
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1 5. A recombinant vector produced by the method of claim 14. 

1 6. A method of making a recombinant host cell comprising introducing the 
recombinant vector of claim 1 5 into a host cell. 

.17: A recombinant host cell produced by the method of claim 16. 
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1 8. A recombinant method for producing a TNFR polypeptide, comprising 
culturing the recombinant host cell of claim 1 7 under conditions such that said polypeptide is ^Vfr 
expressed and recovering said polypeptide. *" ' v 

1 9. An isolated TNFR polypeptide comprising an amino acid sequence at least 

95% identical to a sequence selected from the group consisting of: ^ ^ 

(a) the amino acid sequence of a full-length TNFR polypeptide having the 
complete amino acid sequence shown in SEQ ID NO:2, 4 or 6, or as encoded by a cDN A 
clone contained in ATCC Deposit No. 97798, 97810 or 97809; 

(b) the amino acid sequence of a mature TNFR polypeptide having the amino acid 
sequence at positions 27-259 in SEQ ID NO:2, 31-300 in SEQ ID NO:4, or 31-170 in SEQ 

ID NO:6, or as encoded by a cDNA clone contained in ATCC Deposit No. 97798, 97810 or ; 

97809; or ^T* 

(c) the amino acid sequence of a soluble extracellular domain of a TNFR •* 
polypeptide having the amino acid sequence at positions 27 to 240 in SEQ ID NO:2. 3 1 to v 
283 in SEQ ID NO:4, or 3 1 to 1 66 in SEQ ID NO:6, or as encoded by the cDN A clone *-jg 
contained in ATCC Deposit No. 97798, 978 1 0 or 97809. 
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20. An isolated polypeptide comprising an epitope-bearing portion of the TNFR 
protein, wherein said portion is selected from the group consisting of a polypeptide 
comprising amino acid residues: from about Gln-42 to about Glu-52 in SEQ ID NO:2. from 
about His-58 to about Cys-66 in SEQ ID NO:2, from about Pro-68 to about Thr-76 in SEQ 
ID NO:2, from about Ser-79 to about Cys-85 in SEQ ID NO:2, from about Cys-91 to about 
Thr-102 in SEQ ID NO:2, from about Gln-1 10 to about Pro- 122 in SEQ ID NO:2. from 
about Arg-126 to about Val-136 in SEQ ID NO:2, from about Thr-142 to about Glu-148 in 
SEQ ID NO:2, from about Ala-31 to about Thr-46 in SEQ ID NO:4, from about Phe-57 to 
about Thr-1 17 in SEQ ID NO:4, from about Cys-132 to about Thr-175 in SEQ ID NO:4, 
from about Gly-185 to about Thr-1 94 in SEQ ID NO:4, from about Val-205 to about Asp- 
217 in SEQ ID NO:4, from about Pro-239 to about Leu-264 in SEQ ID NO:4, and from about 
Ala-283 to about Pro-298 in SEQ ID NO:4, from about Ala-31 to about Thr-46 in SEQ ID 
NO:6, from about Phe-57 to about Gln-80 in SEQ ID NO:6, from about Glu-86 to about His- 
106 in SEQ ID NO:6, from about Thr-108 to about Phc-1 19 in SEQ ID NO:6, from about 
His-129 to about VaI-138 in SEQ ID NO:6, and from about Gly-142 to about Pro-166 in SEQ 
ID NO:6 to about in SEQ ID NO:6. 



21 . An isolated antibody that binds specifically to a TNFR polypeptide of claim 
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22. An isolated nucleic acid molecule comprising a polynucleotide having a 
sequence at least 95% identical to a sequence selected from the group consisting of: 

(a) the nucleotide sequence of clone HPRCB54R (SEQ ID NO: 19); 

(b) the nucleotide sequence of clone HSJAU57RA (SEQ ID NO:20); 

(c) the nucleotide sequence of clone HELBP70R (SEQ ID NO:21>: 

(d) the nucleotide sequence of clone HUSCB54R (SEQ ID NO:22); 
(c) the nucleotide sequence of clone 11ELD106R (SEQ ID NO:23); 

(0 the nucleotide sequence of clone HCEOW38R (SEQ ID NO:24); and 
(g) a nucleotide sequence complementary to any of the nucleotide sequences in 
(a), (b), (c), (d), (e). or (f) above. 





Abstract 

The present invention relates to novel Tumor Necorsis Factor Receptor proteins. In 
particular, isolated nucleic acid molecules are provided encoding the human TNFR-5, -6a & - 
6p proteins. TNFR-5, -6a & -6p polypeptides are also provided as are vectors, host cells 
and recombinant methods for producing the same. The invention further relates to screening 
methods for identifying agonists and antagonists of TNFR-5, -6a & -60 activity. Also 
provided are diagnostic methods for detecting immune system-related disorders and 
therapeutic methods for treating immune system-related disorders. 
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Piffur« 1 

CCTCTCCACGCGCACGAACTCAGCCAACGATTTCTGATAGATTTTTGGGAGTTTGACCAG 
AGATGCAAGGGGTGAAGGAGCGCTTCCTACCGTTAGGGAACTCTGGGGACAGAGCGCCCC 
GGCCGCCTGATGGCCGAGGCAGGGTGCGACCCAGGACCCAGGACGGCGTCGGGAACCATA 

CCATGG' :r CGG ATCCCC AAG ACCCTAAAGTTCGTCGTCGTC ATCGTCGCGGTC CTGCTGC 
HARIPKTT, KFVVVI V A Y L *n -2 

CAGTCCTAGCTTACTCTGCCACCACTGCCCGGCAGGAGGAAGTTCCCCAGCAGACAGTGG 
V L A Y S A TTARQEEVPQQTVA 

CCCCACAGCAACAGAGGCACAGCTTCAAGGGGGAGGAGTGTCCAGCAGGATCTCATAGAT 
PQQQRHSFKGEECPAGSHRS 

CAGAACATACTGGAGCCTGTAACCCGTGCACAGAGGGTGTGGATTACACCAACGCTTCCA 
EHTGACNPCTEGVDYTITASN 

AC AATG AAC C TTC TTGCT T C CC ATGT AC AGTTTGT AAATC AG ATC AAAAAC A1 AAAAGTT 
NEPSCFPCTVCKSDQKH KSS 

CCTGCACCATGACCAGAGACACAGTGTGTCAGTGTAAAGAAGGCACCTTCCGG AATG AAA 
CTMTRDTVCQCKEGTFRNEN 

ACTCCCCAGAGATGTGCCGGAAGTGTAGCAGGTGCCCTAGTGGGGAAGTCCAAGTCAGTA 
SPEMCRKCSRCPSGE VQVSN 

ATTGTACGTCCTGGGATGATATCCAGTGTGTTGAAGAATTTGGTGCCAATGCCACTGTGG 
CTSWDDIQCVEEFGANATVE 

AAACCCCAGCTGCTGAAGAGACAATGAACACCAGCCCGGGGACTCCTGCCCCAGCTGCTG 
T PAAEETMNTS PGTPAPAAE 

AAG AG AC AATG AAC ACC AGC C C AGGG AC TCCTGCC CC AGCTGCTG AAG AG AC AATG AC C A 
ETMNTSPGTPAPAAEETMTT 

C C AGCCCGGGG AC TC C TG C C C C AGCTGCTG AAG AG AC AATG ACC ACC AGCCC GGGGACTC 
S PGTPAPAAEETMTTS PGTP 



CTGCCCC AGCTGCTG AAG AG AC AATG ACC ACC AGCCCGGGGACTCCTGCCTCTTCTC ATT 
APAAEETMTTSPGTPASSHY 
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Figure 1 (continued) 



ACCTCTCATGCACCATCGTAGGGATCATAGTTCTAATTGTGCTTCTGATTGTGTTTGTTT 
LSCTIVGIIVLIVLLIVFV* 

GAAAGACTTCACTGTGGAAGAAATTCCTTCCTTACCTGAAAGGTTCAGGTAGGCGCTGGC 
TGAGGGCGGG3GGCGCTGGACACTCTCTGCCCT<XrCTCCCTCTGCTGTGTTCCCACAGAC 
AGAAACGCCTGCCCCTGCCCCAAGTCCTGGTGTCTCCAGCCTGGCTCTATCTTCCTCCTT 
GTGATCGTCCCATCCCCACATCCCGTGCACCCCCCAGGACCCTGGTCTCATCAGTCCCTC 
TCCTGGAGCTGGGGGTCCACACATCTCCCAGCCAAGTCCAAGAGGCAGGGCCAGTTCCTC 
CCATCTTCAGGCCCAGCCAGGCAGGGGGCAGTCGGCTCCTCAACTGGGTGACAAGGGTGA 
GGATGAGAAGTGGTCACGGGATTTATTCAGCCTTGGTCAGAGCAGAACACAGAGATTTTC 
CGTGAAAAAAAA 
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GCTCTCCCTGCTCCAGCAAGGACCATGAGGGCGCTGGAGGGGCCAGGCCTGTCGCTGCTG 

MBALEGPGLSLL 

TGCCTGGTGTTGGCGCTGCCTGCCCTGCTGCCGGTGCCGGCTGTACGCGGAGTGGCAGAA 
riVL ALPAT.T, PVPAVRGV A E 

ACACCCACC: ACCCCTGGCGGGACGCAGAGACAGGGGAGCGGCTGGTGTGCGCCCAGTGC 
TPTYPWRDAETGERLVCAQC 

CCCCCAGGCACCTTTGTGCAGCGGCCGTGCCGCCGAGACAGCCCCACGACGTGTGGCCCG 
PPGTFVQRPCRRDSPTTCGP 

TG TCC AC C GC GCC ACT AC AC GC AG TTCTGG AACT AC CTGGAGC GCTGC CGCT ACTGC AAC 
CPPRHYTQFWNYLERCRYCN 

GTCCTCTGCGGGGAGCGTGAGGAGGAGGCACGGGCTTGCCACGCCACCCACAACCGTGCC 
VLCGEREEEARACHATHNRA 

TGCCGCTGCCGCACCGGCTTCTTCGCGCACGCTGGTTTCTGCTTGGAGCACGCATCGTC 
CRCRTGFFAHAGFCLEH ASC 

CCACCTGGTGCCGGCGTGATTGCCCCGGGCACCCCCAGCCAGAACACGCAGTGCrAGCCG 
PPGAGVIAPGTPSQNTQCQP 

TGCCCCCCAGGCACCTTCTCAGCCAGCAGCTCCAGCTCAGAGCAGTGCCAGCCCCACCGC 
CPPGTFSASSSSSEQCQPHR 

AACTGCACGGCCCTGGGCCTGGCCCTCAATGTGCCAGGCTCTTCCTCCCATGACACCCTG 
NCTALGLALNVPGSSSHDTL 

TGCACCAGCTGCACTGGCTTCCCCCTCAGCACCAGGGTACCAGGAGCTGAGGAGTGTGAG 
CTSCTGFPLSTRVPGAEECE 

CGTGCCGTCATCGACTTTGTGGCTTTCCAGGACATCTCCATCAAGAGGCTGCAGCGGCTG 
RAVI DFVAFQDI SIKRLQRL 

CTGCAGGCCCTCGAGGCCCCGGAGGGCTGGGGTCCGACACCAAGGGCGGGCCGCGCGGCC 
LQALEAPEGWGPTPRAGRAA 

TTGCAGCTGAAGCTGCGTCGGCGGCTCACGGAGCTCCTGGGGGCGCAGGACGGGGCGCTG 
LQLKLRRRl-TELLGAQDGAL 

CTGGTGCGGCTGCTGCAGGCGCTGCGCGTGGCCAGGATGCCCGGGCTGGAGCGGAGCGTC 
LVRLLQALRVARMPGLER SV 

CGTGAGCGCTTCCTCCCTGTGCACTCATCCTGGCCCCCTCTTATTTATTCTACATCCTTG 
RERFLPVH * 

GCACCCCACTTGCACTGAAAGAGGCTTTTTTTTAAATAGAAGAAATGAGGTTTCTTAAAG 
CTTATTTTTATAAAGCTTTTTCATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



I*- 




Figure 3 



59363 U.S. PTO 

60035496 

■llfllll 

01/14/97 



am 



TGGCATGTCGGTCAGGCACAGCAGOGTCCTGTGTCCGCGCTGAGCCGCGCTCTCCCTGCT 

CCAGCAAGGACCATGAGGGCGCTGGAGGGGCCAGGCCTGTCGCTGC'^TGCCTGGTGTTG 
m p A L p r, P n T, s L L C — L V L 

GCGCTGCCTGCCCTGCTGCCGGTGCCGGCTGTACGCGGAGTGGCAGAAACACCCACCTAC 
ft t. P A ». P V P ft V B G V A E T P T Y 

CCCTGGCGGGACGCAGAGACAGGGCAGCGGCTGGTGTGCGCCCAGTGCCCCCCAGGCACC 
PWROAE'TGERLVCAQCPPGT 

TTTGTGCAGCGGCCGTGCCGCCGAGACAGCCCCACGACGTGTGGCCCGTGTCCACCGCGC 
FVQRPCRRDSPTTCGPCPPR 

CACTA r * ACGCAGTTCTCiGAACTACCTGGAGCGCTGCCCCTACTGCAACGTCCTCTGCGGG 
I Y ~T Q F W N Y lerc rycnvlcg 

GAGCGTCAGGAGGAGGCACGGGCTTGCCACGCCACCCACAACCGTGCCTGCCGCTGCCCC 

ereeearachathnracrcr 

ACCGGCTTCTTCGCGCACGCTGGTTTCTGCTTGGAGCACGCATCGTGTCCACCTGGTGCC 
7GFFAHACFCLEHASCPPGA 

GGCGTGATTGCCCCGGGTGAGAGCTGGGCGAGGGGAGGGGCCCCCAGGAGTGGTGGCCGG 
GVIAPGESWARGGAPRSGGF 

AGGTGTGGCAGGGGTCAGGTTGCTGGTCCCAGCCTTGCACCCTGAGCTAGGACACCAGTT 
F CGRGQVAGPSLAP- 

CCCCTGACCCTCTTCTTCCCTCCTGGCTGCAGGCACCCCCAGCCAGXACACGCAGTGCCA 
G: rCTGCCCCCCAGGCACCTTCTCAGCCAGCAGCTCCAGCTCAGAGCAGTGCCAGCCrCA 
rCGCAAClCCACGGCCCT^GGCCTGGCCCTCAATGTGCCAGGCTCTTCCTCCCATGACAC 
CCTGTGCACCAGCTGCACTGCCTTCCCCC7CA.3CACCAGGGTACCAGGTGAGCCAGAGGC 
CTGAGGGGGCAGC'ACACTGCAGGCCAGGCCCACTTGTGCCCTCACTCCTGCCCCTGCACG 
TGCATCTAGCCTGAGGCATGCCAGrTGGCTCTGGGAAGGGGCCACAGTGGATTTGAGGGG 
- JAGGGfiTCCCTCCACTACATCCCCACCAAGTCTCCCCTCTCAGGGGTGGCTGAGAATTT 



P 



GGATCTGAGCCAGCGCACAGCCTCCCCTGGAGAGCTCTGGGAAAGTGGGCAGC 
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Piffura 3 (continued) 

TAACTGCCCGAGGGGAAGGTGGCTGGCTC CTCTGAC ACGGGGAAACCGAGGCCTGATGGT 
AACTCTCCT AACTGCCTGAGAGGAAGGTGGCTGCCTCCTCTGAC ATG CGGAAACCGAGGC 
CC AATCTTAACC ACTGTTGAGAAGTCACAGGGGGAAGTC«\CCCCCTTAAC ATC AACTCAG 
GTCCGGTCCATCTGCAGGTCCCAACTCGCCCCTTCCGATGOCCCAGGAGCCCCAAGCCCI' 
TGCCTOGGCCCCCTTGCCTCTTX^AGCCAAGGTCCGAGTGGCCGCTCCTGCCCCCTAGGC 
CTTTGCTCCAGCTCTCTGACCGAAGGCTCCTGCCCCTTCTCCAGTCCCCA 
CCCCTCTCCAGCACGGCTCACTGCACAGGGAr I TC TC TC T CCTGC A VACCCCCCGAGTGG 
GGC^CAGAAAGCAGGGTACCTGGCAGCCCCCGCCACTGTGTGTGGGTGAAATCATCGGAC 

CGC~GCCTCCCCACCCCACIX>CAGGAGCT^ 

TC "»TTTTCCAGGACATCTCCATCAAGAGGAGCGGCTGCTGCAGGCCC 
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Figure 6 
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Figure 7 
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5»3«3 U.S. PTO 



BZX.DX06K 

iGCACGAGCA 
TGAGGGCGCT 
1GCTGCCGGT 
NAGAGACAGG 
^TGCCGNCG 
. . iXJGAACTAC 
J 3C ACGGGTT 
N'GGCAAGTTG 
TNNNCNCGGC 



GGGTCCTGTN 
GGAGGGGCCA 
GCCGGCTGTA 
GGAGCGGCTG 
AGACACCCCC 
CTGGAGC.V-i- 
TNCCACGNCA 
GTTTTTNNTT 
AAACTNAAA 



TCCGCCCTGA 
GGCCTGTCGC 
CGCGGAGTGG 
GTGTNTNCCC 
ACGACGTGTG 
CNCGTTACTN 
ACCACAACCG 
TGGAGNAAGG 



GCCGCGCTCT 
TGCTGTGCCT 
CAGAAACACN 
ANTGCCCCCC 
GCCCGTNTCC 
CAACGTCCTC 
NGGNTTACCG 
ATTCGTGTTN 



NCCTGCTCCA 
GGTGTTGGCG 
NACNTACCCC 
AGGCACCTTT 
A<~CGCGCCAC 
TGCGGGGAGC 
TNGCCGNACC 
CAATTNATTG 



GCAAGGACCA 
CTGCCTGCCC 
TGGCGGGACG 
NTGCAGCGGC 
TAC^CGCATT 
OTNAGGAGGA 
GGTTTCTTCG 
ACGNAGTGAT 



^CAACTGCA CGGCCCTGGG ACTGGCCCTC AATGTGCCAG GNTCTTCCTC CCATGACACC 
r-GTGCACCA GCTGCACTGG CTTCCCCCTC AGCACCAGGG TACCANGAGC TGAGGAGTGT 
•-«GCNTGCCG TCATCGACTT TT rGGCTTTC CAGGACATCT CCATCAAGAG GCTGCAGCGG 

♦GCTCANGC C 



4FRCB54R 

aJVTTCGGCA 
AGTTTGAC 
.vCAGICNCGCC 
TCGGGAACCA 
;GTC r TGCT 
-CAGN^ANTG 
; ;TTTTTATA 
"TAANGTTTC 
AATTTNTNA 

MSJAU57RA 

r.CCAGAGGTG 
TGTGCA :CC 
PCTCCCAGC 
iGGGGGCAG 

*.TT"~ATTCAG 
.NTTCCCCTT 



NAGCCTCTCC 
C AG AG ATGC A 
CCGGCCGCCT 
TACCATGGCC 
GCCAGTCCTA 
GNCCCACAGC 
GTTCAGAACN 
C AAN ATG \* AC 
ACCTTNCCNG 



ACGCGCACGA 
AGGGGTGAAG 
GATGGCCGAG 
CGGATCCCCA 
GCT7ACTCTG 
AACAGNGGCA 
TATTGGNGCT 
TTTTTNGTTC 
GGNAAATT 



TCTCCAGCCT 
CCCAGGACCC 
CAAGTCCAAG 
TCGGCTCCTC 
CCTTGGTCAG 
CTTNATNCCC 



GGCTCTATCT 
T GGTCTC ATC 
AGGGCAGGGC 
AACTGGGTGA 
AGCAGAACAC 
CTTTCN 



ACTCAGCCAA 
GAGCGCTTCC 
GCAGGGTGCG 
AG AC CCT AAA 
CCACCACTGC 
CAGTTTCAAG 
NTNAACCCTT 
CCTGTTANAT 



TCCTCCTTGT 
AGTCCCTCTC 
C\GTTCCTCC 
C AAGGGTG AG 
AGAGTTTTTC 



CGATTTCTGA 
TACCGTTAGG 
ACCCAGGACC 
GTTCGTGGTC 
CCGGCAGAGG 
GGGGNAGGAG 
GCACAAGGGT 
TTTTTAATTA 



TAG ATT TT TG 
AACTCTGGGG 
CAGGACGGCG 
GTCATCGTCG 
AAGTTNCCCA 
TTTTCCANC \ 
TTGGNTTAAA 
GTTNAAANTT 



NATCGTCCCA TCCCCACATC 
CTGGAGCTGG GGGTCCACAC 
CATCTTCAGG CCCAGCCAGG 
GATGAGAAGT GG7CACGGGG 
CGTGTGTTGG TTTTTACTCT 



v. 
\ 



.^CAGACGCC 
: JATTACCTC 
TGTTTGAAA 

NGCCTGGCTG 
"CCCACAGA 

' TATCTTNNC 



CCAGCTGCTG 
TNATGCACCA 
AGANTTCACT 
AGGGCGGGGG 
CAGAAACGCC 
TCCTTGTGAA 



AAGAGACAAT 
TCGTAGGGAT 
GTGGAAGAAA 
GCGCTGGTAC 
TGCCCCTGNC 
TCGTTCCCAT 



AATCACCAGC 
CATAGTTCTA 
TTCCTTCCTT 
ACTCTCTGAC 
CCCAAGTTCC 
CCCCACANGC 



CCGGGGACTC 
ATTGTGCCTT 
ACCTGTAAGT 
CCTGCCTCCC 
TNGTGTTTTC 



CTGN*NTCTNC 
CTAATTGTTT 
TNCAGGTAQG 
TCTGNCTGTT 
CAGCCTGGCT 



"CAGGGTCTC CTNCCCCACC TGCTGAAGAG ACANTGACCA CCAGCCCGGG GACTCCTGCC 
^ rTCAT TACC TCTNAT GNANCATCGT AGGGATCATA GTTCTAATTG TGCCTTCTGA 
ATTG"GCTTT GTTTGGAAAG ACTTCACTGT GGGAAGAAAT TCCTTCCTTA CCTGAAGTTG 
CAGGT*GCCC CTGGGTNAGG GCGNGGGCCG CTGGACANTN TCTGGNCCTG GCTGCCCGCT 



m • 



